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(57) .Abstract: The presen! invention relates to the field of plan! moiecalar biology, more panicuiariy to the root-specific gene 
exp.ression in piants. The invention provides nucleic acids for a novel tra^sc^ip^ionaI regulatory rooi-spexific promoter and nucleic 
acid and protein sequences coding for a Psew LRR receptor-kinase protein, fwther specified as a root cSavata 1 homoiog (RCHl). 

^2 Further provided aie composilioris comprising tiacleic acids, polypeptides, antibodies aiid vectors. The invention furtiier provides for 
methods for modifying ceil fate and/or plant development and/or plant morphology and/or plant biochemistry and/orpiant physiology 
comprising ihe modification of expression in particular cells, tissues or organs of a plant of the novel LRR rexeptor-like kinase 
or comprising the expressing of a gene of interest under the control of the novel transcriptional regulatory root-specific promoter. 

)^ Funher are provided corapounds interacting witti the new polypeptides for use as herbicides or grovjth regulators. 
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A NOVEL ROOT SPECIFIC PROMOTER DRIVING THE EXPRESSION OF A 
NOVEL LRR RECEPTOR-LIKE KINASE 

FIELD OF THE INVENTION 

5 The present invention relates to the field of plant molecular biology, more particularly to 
the root-specific gene expression in plants. The isolation of a root specific operon, 
comprising a transcriptional regulatory promoter that contributes to root-specific gene 
expression and Its operably linked gene encoding a novel Leucin Rich Repeat (LRR) 
receptor-iike kinase, is disclosed. Said transcriptional regulatory promoter may be used 

10 for driving root-specific expression of at least one gene of interest in a transgenic plant 
and said encoded LRR receptor-iike kinase gene may be used to alter the features 
and/or to confer a selective advantage to transgenic plants. 

BACKGROUND TO THE INVENTION 

15 Plant promoters 

Initiation of transcription is generally understood to be the predominant conlrolling factor 
in determining expression of a gene. The transcriptional control elements, which may 
interact with DNA binding proteins, are generally embedded in the sequence 5'-fjanking 
or upstream of the transcribed gene. These DNA sequence elements promote the 
20 fornnatlon of transcriptional regulatory complexes that either activate or repress the 
expression of the gene 3' downstream of the promoter. 

The sequences of the promoter vary in length and base pair composition from gene to 
gene. Within the sequence of the promoter, many regulatory sequence elements can be 
embedded that control the promoter activity. These specific elements, called DNA- 

25 boxes, contribute to the promoter a defined feature or a defined activation pattern. Many 
of these boxes are recognition sites to which regulatory transcription factors can bind 
and are part of a tight control mechanism of the promoter. Numerous plant promoter 
DNA boxes have been described, such as tissue-specific boxes (Chaubet et ai. 1996), 
pyrimidin boxes (Gubler & Jacobsen 1992), which influence the level of expression or G- 

30 boxes (Dolferus et al. 1 994) which reduce promoter activity by cold or dehydratation. 
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Constitutive promoters are promoters that drive the expression of genes in a continuous 
manner, whereas ubiquitous promoters drive the expression of the gene throughout the 
ent.rc plant. 

ELxpression of heterologous DNA sequences in a host is dependent upon the presence 
5 d an operably linked promoter that is also present in that host, whereby the choice of 
pfomoter will determine when, where or how strong the heterologous DNA sequence will 
IX' expressed in the host organism. Frequently those skilled in the art desire to limit the 
expression of the transf acted DNA sequence to a certain time-period (e.g. to a particular 
pTuiso of plant development), to a defined tissue of the host (e.g. leaves), or to a certain 
10 expression level. Controlling the expression of the heterologous genes in transgenic 
pianis is considered to provide several advantages to the plant over ubiquitous and 
Lxmit.tijtive expression. There is a broad need in many industriai appiications for 
tr.jnscnptionat Control efernents capable of driving tissue specific gene-expression in 
plants !f is therefore important that such control elements are continuously provided arsd 
IS v\i\ new control elements with their specific transcriptional features are isolated and 
characterized. 

Promoter regulation 

ProTTKiters from both monocots and dicots that have a root root-preferred expression 
:c putiern have already been described (Fowler, Bernhardt, & Tierney 1999;Hwang & 
Gocxirnan 1995;Lagarde et al. 1996;Okada et a!. 2000;Okumura et ai. 1994;Pysh at al. 
1999, Schneider et ai. 1998;Vidmar et al, 2000;Wanapu & Shinmyo 1996;W!nicov 
?O0O)WO9007001 (Baszczynski et al. 2000;Borg et al. 1997;Masuda, Sakuta, & Satoh 
1999. Webb et ai. 2000) (de Pater & Schilperoort 1992;Ely, Evans, & Schuch 
25 2000.Goddemeier, Wulff, & Feix 1998;Kikuchi et al. 1 999,Suzuki, Fowler, & Tierney 
i993.Unbe et al. 1998;Woo & Hawes 1997;Xu et al. 1995) (Yamamoto et ai. 1991) 
(Gwitch et al. 1997;Keller & Lamb 1989;Lauter 1996;0!son, Oetiker, & Yang 1995;van 
ckK Zaal et al. 1991). Many of them are root-preferred rather then root-specific and are 
expressed at low levels in tissues other than the root. These promoters are not suitable 
30 for industrial applications where it us of the utmost importance that the heterologous 
product is expressed in the root only. As an example, ectopic expression of cell cycle 
control elements to obtain growth arrest in some parts of the root (e.g. lateral roots). 
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should be strictly regiiiated to that tissue in order not to interfere with the overall grovyth 
of the plant. In contrast, other promoters are strictly and exclusively confined to root- 
specific expression and can therefore be valuable for some applications demanding tight 
control of the expression. In this group of root-specific promoter subgroups can be 

5 distinguished based on the root-tissue they are expressed in. The HRGPntS gene of 
tobacco for instance is only expressed in the perlcycle and the endodermis of the root, 
more specifically in the discrete subset of cells, involved in the initiation of the lateral 
roots (Keller et al. 1989). This makes the promoter of HRGPntS highly specific. Another 
example is the XSP30 sequence, which is only activated in the xylem of the root. 

10 For persons skilled in the art it is of importance to have at their disposal a wide variety of 
root-specific promoters with different root tissue-specificity from which they can choose 
to confer a specific feature to the transgenic plant. 

Many of these root-preferred promoters are not only limited to tissue specific expression, 
but are also subjected to a tight controi mechanism deciding when the promoter should 

:i5 be active. This decision depends on the function and the effects of the gene product 
that is switched on/off by the related promoter. Some gene products are switched on/off 
at a certain developmental stage of the plant, at a certain moment of the cell cycle, in 
stress situations originating from weather factors, upon injury caused by pathogens or 
upon a change in the environmental resources. A typical example is the ARSK1 gene of 

20 Arabidopsis thaliana, that is activated when the roots are exposed to air during growth 
(dehydration) or by treatment of roots with abcisic acid or salt (Hwang et af. 1995). 
However, for many industrial and agronomic applications, where the aim is to confer a 
special feature at any time of the developmental, environmental or physical state of the 
plant, a constitutive promoter will be needed. For example when the gene product of the 

25 transgene is to be isolated and purified from the plant for commercial purposes. 

It is important for the persons skilled in the art to have at their disposal a wide variety of 
promoters with different time-control features from which they can chose to confer a 
specific feature to the transgenic plant. 

30 Piant root development 

Root development is an essential determinant of plant growth and crop yield since the 
root is the main channel to extract nutrients from the environment. Any changes in the 
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root that contribute to more functional organ tissue will result in higher crop yield. Also 
any regulatory element that is Involved in the root development and whose control is 
subjected to developmental events can contribute better yield properties to a transgenic 

plant. 

5 Cells of the shoot and root meristems are mltoticaliy active and they extend the 
undifferentiated tissue pattern of the embryo as a pool of cells from which cells can 
differentiate into several tissues of the adult plant. This process continues throughout the 
life span of the plant. It is dear that the root meristenri plays a pivotal role in the formation 
of strong roots and therefor also in affecting the growth and crop yield. 

10 In root meristem, the emerging picture is that the integrity of the root meristem is kept by 
balancing cell proliferation and cell differentiation, and that differentiation-inhibiting 
signals originating from the quiescent center are involved (van den Berg et ai. 1 997). 
Inhibition of differentiation only occurs in cells that contact the quiescent center, which 
reveals that short-range or contact-dependent signals are involved in this process. 

15 However which compounds play a role in this process and whether positional cues 
guiding root ceil fate are targeted via plasmodesmata or via the cell wall remains 
unknown (van den Berg et al. 1997). 

In the shoot meristem more progress has been made in characterizing molecular 
components involved in meristem maintenance. Clavata (civ) mutants accumulate 

20 massive pools of undifferentiated cells in the central zone of shoot and floral meristem. 
The CLAVATA genes show a shoot and flora! meristem specific expressionpattern. Root 
meristems of c/v-mutants are unaffected (Clark, Running, & Meyerowitz 1 993). The 
CLAVATA loci {CLV1, CLV2 and CLV3) appear to promote the transition tov.'ard 
differentiation of cells at the shoot and floral meristems, and/or to restrict the proliferation 

25 of cells at the center of these meristems (Clark et a!. 1993). CLV1 acts with CLV3 and 
WUSCHEL (WUS) and SHOOT MERISTEMLESS (STM) to maintain the integrity of the 
central zone which acts as a reservoir of stem cells. A model for stem-cell maintenance 
in the root meristem and in the shoot meristem is depicted in Figure 1 . 

30 CLV1 homologs and CLV1 -redundancy 

The CLAVATA 1 gene resides on the Arabidopsis thaliana chomosome 1. Clv1 mRNA is 
found in the L3 tunica cell layer of the central zone and encodes a protein with ieuclne- 
rich repeats (LRR, of a type found in a number of proteins, notably the products of 
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several plant-pathogen resistance genes), a putative serin/threonin kinase domain and a 
predicted extracellular receptor domain. Expression of the c(v1 gene is initiated in the 
heart stage of the embryo, when cotyiedonary primordia are apparent, and is 
ir>depcndent of the STM (SHOOT MERISTEMLESS) activity (Long & Barton 1998). 

s CLAVATA1 is a leucine-rich receptor-like kinase. The Arabidopsis thaliana genonne 
protect has so far identified over 50 LRR transnnennbrane receptor serine/threonine 
kinases. It can be estimated that the total Arabidopsis ttialiana genome contains In the 
order of 100 family members. Other members of the CLAVATA family are ERECTA (Ef?) 
(Torn et al. 1996), BRASSINOLIDE INSENSITVE (BRI) (Li & Chory 1997) and HAESA 

10 (H4£) (Jinn, Stone, & Walker 2000), genes which have a function in plant development. 
Analogues of the Arabidopsis thaliana CI-AVATA genes were also found in soybean 
(Glyctn© max). They were isolated from the wild type and the fasciating mutant of 
soyt>ean and were designated GmCLVM and GmCLVI B (Yamamoto, Karakaya, & 
Knap 2000). 

IS W «a the aim of the present invention to isolate novel promoters and genes which are 
useful for altering features of transgenic plants and thus confer selective advantage to 
transgenic plants, 

AH me aims of the present invention have been met by the fonfnulation of the following 
preforred embodiments of the current invention. 

20 

SUMMARY OF THE INVENTION 

In the present Invention a DNA operon was isolated from Arabidopsis Hiaiiana, which 
comprises a novel root-specific promoter, driving a gene encoding a Leucine Rich 
Repeat (LRR) receptor-like kinase. Unexpectedly, the inventors discovered that the new 
:^ opwon was specif icaiiy expressed in roots and more surprisingly in the meristem of the 
main and lateral roots, in the main and lateral root vascular tissue and in the lateral root 
pnfnofdsa. 

The surprisingly specific expression pattern, as well as the very strong expression level, 
makes the newly found promoter a very attractive and selective tool to drive root-specific 
30 expression of any gene of interest The ability to target meristem cells of the root makes 
this promoter clearly distinct from the root-specific promoters presently known in the art. 
Another surprising aspect of the invention is the identification of the new LRR receptor- 
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like kinase gene, which is the first gene identified in roots shovi/ing homology with a 
clavata-gene. This gene is specifically expressed in roots in a specific developmental 
stage of the root cells (nanneiy in the meristematic phase) and can therefore contribute 
agronomic interesting features to a transgenic plant when transfected herein. Therefore 

5 the root-specific promoter is further denominated as ROOT CLAVATA HOMOLOG 1 
promoter (RCH1 prom) and the LRR receptor-like kinase as RCH1 . 
Accordingly, the invention embodies an isolated DNA sequence with nucleotide 
sequence as given in SEQ ID NO 1 (Figure 5) comprising a novel root specific regulatory 
promoter sequence with nucleotide sequence as given in SEQ ID NO 2 (Figure 6) or 

10 SEQ ID NO 18 (Figure 13), and an isolated DNA sequence with nucleotide sequence as 
given in SEQ ID NO 3 {Figure 8), encoding a novel ROOT CLAVATA HOMOLOG 1 
gene, with amino acid sequence given in SEQ ID NO 4 or SEQ ID NO 19 (Figure 9), 
Furthermore the invention embodies a method for modifying cell fate and/or plant 
development and/or plant morphoiogy anci/or biochemistry and/or physioiogy comprising 

15 the modification of expression in particular cells of the root of a particular transgene 
operably United to the RCH1 promoter sequence of the present invention. Provided in 
the current invention are methods to effect expression of any gene of Interest in 
particular cells of the plant root. The present invention also relates to a method to effect 
expression of a ROOT CLAVATA HOMOLOG 1 protein or a homologue or a derivative 

20 thereof in a plant cell, tissue or organ. The present invention clearly extends to any plant 
produced by the Inventive methods described herein. 

DETAILED DESCRIPTION OF THE INVENTION 

One of the technical problems underlying the present invention is to provide regulatory 
25 elements that are active in the root only and that show no expression in other tissues. 
Additional to the highly selective and tissue-specific regulation of a promoter, other 
features such as very strong activity, continuous activity, and termination of activity at a 
certain developmental stage of the cells, would contribute an unique combination of 
regulatory elements to specifically regulate the expression of a gene of interest in the 
30 roots of a transgenic plant, thereby providing a means for modifying ceil fate and/or plant 
development and/or plant morphology and/or biochemistry and/or physiology. Also the 
isolation of root-specific DNA sequences encoding proteins that could play a role in root 
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development is envisaged for use in modification of ceil fate and/or plant development 
and/or plant morphology and/or plant biochemistry and/or lant physiology. 
In a first embodinnent, the present invention relates to an isolated nucleic acid comprising 
a transcriptional regulatory root promoter sequence and/or a nucleic acid sequence 
5 encoding a novel LRR receptor-like kinase (protein) or an immunologically active and/or 
functional fragment of such a regulary element or protein (kinase) selected from the 
group consisting of: 

(a) nucleic acid comprising at least part of the DNA sequence as given in any of 
SEQ ID NOs 1 , 2, 3 or 1 8, or the complement thereof, 
10 (b) nucleic acid comprising the RNA sequences corresponding to at least part of any 
of SEQ ID NOs 1, 2, 3 or 18, or the complement thereof, 

(c) nucleic acid specifically hybridizing, preferably under stringent conditions, with 
the nucleotide sequence as defined in (a) or (b), 

(d) nucleic acid encoding a protein comprising the amino acid sequence as given in 
15 SEQ ID NO 4, SEQ ID NO 5 or SEQ ID NO 19. 

(e) nucleic acid which is degenerated as a result of the genetic code to a nucleotide 
sequence encoding a protein as given in SEQ ID NO 4, SEQ ID NO 19 or to a 
nucleotide sequence as defined in (a) to (d), 

(f) nucleic acid which is diverging due to the differences in codon usage between 
20 the organisms to a nucleotide sequence encoding a protein as given In SEQ ID 

NO 4, SEQ ID N0 1 9 or to a nucleic acid sequence as defined in (a) to (d), 

(g) nucleic acid encoding a protein as given in SEQ ID NO 4, SEQ ID NO 19 or as 
defined in (a) to (d), which is diverging due to the differences between alleles 
encoding a protein 

25 (h) nucleic acid encoding a fragment of a protein encoded by a DNA sequence as 
given in SEQ ID NO 1 or SEQ ID NO 3 or encoding a fragment of a protein 
encoded by a nucleic acid as defined in any one of (a) to (g), wherein said 
fragment comprises the sequence as represented in SEQ ID NO 5, and, 

(k) nucleic acid encoding a protein as defined in SEQ ID NO 4, SEQ ID NO 19 or 
30 nucleic acid as defined in any one of (a) to (h), said nucleic acid interrupted by 

intervening DNA sequences. 
The sequence as depicted in SEQ ID NO 1 is part of a sequence deposited under the 
Genbank Accession number AB017061, which represents the entire Arabidopsis 
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thaiiana chromosome 5. The nucleic acid of the invention Is not the nucleic acid with the 
sequence as deposited under the Genbank Accession number ABQ1 7061 (Figure 14). 
Furthermore, in a preferred embodiment, the present invention reiates to an isolated 
nucleic acid compnsing a novel root-specific promoter sequence, termed RCHIprom, 

5 isolated from Arabidopsis thaiiana and consisting of the nucleic acid as represented in 
SEQ ID NO 2 or SEQ ID NO 18, or a functional part of said sequence, which is able to 
regulate gene expression in a root specific manner as SEQ ID NO 2 or SEQ ID NO 18 
itself. More specifically, said isolated DNA sequence provides a novel type of a root- 
specific promoter, which differs from other known root-specific promoters in the fact that 

10 it is unexpectedly active in only distinct cells of the root, namely the meristem of the main 
and the lateral root, as well as in the vascular tissue of the main and lateral root and in 
the lateral root primordia. 

The isolation of the root specific promoter RCH1 of the present invention is described in 
Example 1 . 

15 Another aspect of the invention is the isolation and sequencing of a root-specific Leucin 
Rich Repeat (LRR) receptor-like kinase gene from root tissue of Arabidopsis thaiiana 
and is also described in Example 1. The nucleotide sequence is listed in the current 

specification as SEQ ID NO 3 and illustrated in Figure 8. This sequence is present as 
part of the sequence deposited under the GenBank Accession number AB017061 , which 

20 represents the entire Arabidopsis thaiiana chromosome 5. 

The novelty of the RCH1 promoter sequence relative to known root-specific promoters is 
substantiated by its unique expression pattern in plant roots. The analysis of the RCH1 
expression patterns was done using a RCHIprom-GUS hybrid operon that was 
transformed to Arabidopsis thaiiana as outlined in Example 2. Analogous experiments 

25 with the RCHIprom-GUS hybrid operon were performed to establish the expression 
pattern of RCH1 in embryo's (Example 8 and Figure 15). 

The RCH1 promoter is active in the meristem of the main and the lateral roots, as well as 
in the vascular tissue of the main and the lateral root and in the lateral root primordia. 
The promoter of the present invention is the first known promoter that regulates a 
30 C LAV ATA HO MO LOG gene in roots, and is clearly distinct from other described 
CLA VA T/A-operons which are all active in the shoot and floral meristem. The present 
data thus cleariy distinguish RCHIprom from other plant root-specific promoters, which 
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are not expressed in the meristem of the main and lateral root, in the vascular tissue of 
the main and lateral root and in the laterai root primordia. 

From the results obtained by the RCH1;:GFP expression studies in root it was even 
more clear that the.RCHI promoter is strongly active in the endodermis, cortex, 

5 epidermis, lateral root cap, in short In the "division zone". The RCH1 promoter is also 
active but in a lower manner, in the quiescent center and in the vascular tissue. 
A further characteristic of the RCH1 promoter is its very high expression level. Based on 
the incubation time of the RCH1prom-GUS hybrid operon transformed plant with the 
GUS substrate (1 hour) and the very high output signal in the root of that plant (Example 

10 2, Figure 7D), it is clear for persons skilled in the art that the RCH1 promoter is very 
strong. 

A further characteristic of the present invention is that the activity of the RCH1 promoter 
in the meristem ceiSs stops when these cells start to differentiate and to elongate to 
become cefis of specified root tissue other than root vascuiar tissue. Surprisingly the 
15 promoter is activated again in cells of the pericycSe that form lateral root prirnordia. This 
expression is extended in the meristem of the lateral root, but again, the cells that 
differentiate into lateral root tissue other than vascular tissue do not retain RGH1 
promoter activity. 

Functional parts of the promoter of the invention will have at least one of these 
20 characteristics. 

According to a further embodiment, the present invention relates to an isolated nucleic 
acid comprising a transcriptional regulatory root promoter comprising at least part of a 
sequence as given in SEQ ID N01 , 2 or 1 8. 

According to another embodiment, the present invention relates to an isolated nucleic 
25 acid as represented In SEQ ID NO 2 or 18 wherein said transcriptional regulatory root 
promoter is operably linked to said nucleic acid sequence encoding a novel LRR 
receptor-tike kinase or to an immunologically active and/or functional fragment thereof. 
According to the invention, distinctive fragments of the newly identified promoter, which 
contribute specific features to said promoter, ■ can be fused to another nucleic acid 
30 sequence (e.g. a promoter) in order to alter the sequential and/or regulatory features of 
the latter. 

The invention thus relates to an isolated nucleic acid having transcriptional regulatory 
root specific, root-meristem specific or root-vascular-tissue specific promoter activity 
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comprising at least part of the DNA sequence as given in any of SEQ ID 1, 2 or 18 and a 

second transcriptional regulatory sequence. 

In one embodiment, said second trancriptionai regulatory sequence is for instance an 
inducible box or sequence, which is placed xvithin the RCH1 promoter, wherein the 
5 function of the RCH1 pron^oter is not hampered but instead can be regulated, for 
instance by induction. In the latter embodiment, the RCH1 promoter is not used to confer 
root-specificity to other promoters, but a second sequence is used to confer an 
additional regulatory feature to the RCH1 promoter. 

In another embodiment said second transcriptional regulatory sequence is a promoter 
10 sequence not normally exhibiting root-specificity, thereby resulting in a hybrid promoter 
with root-specificity. 

Accordingly, another embodiment of the present invention is to confer root-specificity, 
root-meristem-specificity, and root-vascular-tissue-specificity and/or root abundant gene 
expression to other promoter sequences. This can be achieved by fusing elements of 
15 the here-disclosed RCH1 promoter to the promoter sequence of interest preferably the 
widely used ubiquitin promoter. Such modifications can be achieved by routine 
experimentation by those skilled in the art. 

It should be further understood that the invention preferably relates to any of the isolated 
nucleic acids as described above which is DNA, cDNA, genomic DNA, synthetic DNA or 

20 RNA wherein T is replaced by U. 

The invention also relates to a nucleic acid moiecule of at least 1 5 nucleotides in length 
hybridizing specificaiiy with any of the nucieic acids of the invention. The invention also 
relates to a nucleic acid molecule of at least 15 nucleotides in length specifically 
amplifying a nucleic acid of the invention. 

25 Further, the invention relates to a vector comprising a nucleic acid of the invention. Said 
vector may be an expression vector wherein the (said) nucleic add sequence encoding a 
novel LRR receptor-like kinase or an immunologically active and/or functional fragment 
thereof, is operably linked to one or more control sequences allowing the expression in 
prokaryotic and/or eukaryotic host cells. 

30 The invention also relates to a vector comprising at least part of a nucleic acid according 
to the invention and wherein a transcriptional regulatory root promoter of the invention or 
functional part(s) thereof, is operably linked to one or more genes of interest. 
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The Invention also relates to a host cell containing a nucleic acid molecule of the 
invention or a vector of the Invention wherein said nucleic acid or vector has been 
introduced by transfomation, transfection or infection. Preferred host cells that are part of 
the invention are bacterial, Insect, fungal, plant or animal cells. 

5 Therefore, in a preferred embodiment, the invention relates to an isolated nucleic acid as 
represented in SEQ ID NO 3 encoding a LRR receptor-like kinase characterized by an 
amino acid sequence as giveri In SEQ ID NO 4 or 19. More specifically, said isolated 
nucleic acid encodes a plant ROOT CLAVATA HOMOLOG 1 (RCH1) of a novel type 
which is unexpectedly expressed In roots. Said novel plant RCH1 shows significant 

10 homology with CLAVATA 1 , a protein involved in the maintenance of floral and shoot 
meristem cells. 

An initial comparative amino acid sequence homology search using the BLASTP 2.0.8 
software {Altschui et ai. 1997) revealed that most significant alignments of RCH1 protein 
are produced with receptor-iike protein kinases, more specifically with leucine rich 
15 receptor-like kinases such as CLAVATA1 (CLV1). Furthermore, the most homoiogous 
protein with a known function of this list, is the CLAVATA 1 receptor-like kinase from 
Arabidopsis thaliana. Therefore the novel gene was denominated a ROOT CLAVATA 
HOMOLOG 1 gene(RCHI). 

An amino acid sequence alignment of the RCH1 protein and CLAVATA 1 from A. 
20 thaliana is shown in Figure 1 0. Calculation of the percentage of identical amino acids 
residues between RCH1 and CLV1 teaches that RCH1 is 39% identical to CLV1. 
Additional to 39% of identical amino acids, there is 49% of the amino acids that belong 
to the same class of amino acids and therefore have the same physical and chemical 
properties. 

25 The classification of RCH1 as a novel type of CLAVATA is documented further in figures 
11 and 12. 

The conclusions we can draw from figure 11 is that based on the sequence, RCH1 is 
closer related to CLV1 compared to other known CLV1 homoiogues, such as ERECTA, 
BRASSINOLIDE INSENSITIVE and HAESA. 
30 The fact that the novel LRR receptor like protein described in the present invention is 
rather a root ciavata 1 hom.ologue then another kind of LRR receptor-like kinase (e.g. 
resistance genes) is illustrated in Figure 12 in which the evolutionan/ relationship 
between ciavata homoiogues and resistance genes is shown. The resistance genes 
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(RPS5, RPS2, RPP8, RPM1, RPS4, N, RPP5) are described in "Meyers BC, Dickerman 
AW, Michelmore RW, Sivaramakrishnan S, Sobrai BW, Young ND. Plant disease 
resistance genes encode members of an ancient and diverse protein family within the 
nucleotide-binding super family. Plant J, 1 999 Nov;20(3):317-32". 
5 The present invention relates to a ROOT CLAVATA HOMOLOG 1 gene that differs from 
other known CM\MTA-like genes, In the fact that it is active in the root meristem, 
whereas the other known CLAVATA genes and CLAVATA homologs (like ERECTA, 
BRASSINOLIDE INSENSITIVE and HAESA) are only active in the shoot and floral 
meristem. 

10 According to another embodiment, the invention also relates to nucleic acids encoding a 
functional plant LRR receptor-like kinase comprising one or more protein regions, 
distinguishing said plant LRR receptor-like kinase from those LRR receptor-like kinases 
known in the art. 

The inventors found that the predicted amino acid sequence (BAB10317.1, receptor 
15 protein kinase-like protein) corresponding to the database entry with accession number 

AB017061 (A. thalinana chromosome 5), was missing a part of the amino acid sequence 
(namely, 25 amino acids as represented in BED ID NO. 5 of the present patent 
application). This extra stretch of 25 amino acids was identified by the cioning and the 
sequencing of the corresponding cDNA of the present invention. Therefore, the 
20 predicted prior art sequence is believed not to represent a functional enzyme. 

The invention thus relates to an isolated nucleic acid selected from the group consisting 
of: 

(a) a nucleic acid consisting of at least part, preferably at least a functional part, of 
the DNA sequence as given in SEQ ID NO 1 or 3, or the complement thereof, 

25 (b) a nucleic acid encoding a protein as given in SEQ ID NO 4 or 19 or encoding a 

fragment of said protein, wherein said fragment comprises the sequence as 
represented in SEQ ID NO 5, and 

(c) a nucleic acid encoding a protein with an amino acid sequence which is at least 
60%, preferably at least 65 %, 70 %, 75%, 80%, 85%, more preferably at least 
30 90% or 95% identical to the protein as given in SEQ ID NO 4 or 9, wherein said 

amino acid sequence comprises the sequence as represented in SEQ ID NO 5, 
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characterised in that said nucleic acid encodes a novel LRR receptor-like kinase protein 
or an immunologically active and/or functional fragment of such a protein, and further 
provided that said nucleic acid is not one of the nucleic acids as deposited under the 
GenBank Accession numbers AB017061 or AQ966419. 
5 The present Invention also relates to an isolated LRR receptor-like kinase (protein) 
comprising one of the polypeptides selected from the group consisting of: 

(a) a polypeptide as given in SEQ ID NO 4 or SEQ ID NO 19, 

(b) a polypeptide comprising the amino acid sequence as given in SEQ ID NO 5, 
and, 

10 (c) a polypeptide comprising the sequence represented in SEQ ID NO 5 and 

encoded by a nucleic acid as given in SEQ ID NO 1 or 3, 

or a homologue or a derivative of said protein, or a fragment or an immunologically 
active and/or functional fragment thereof, provided that said homologue is not the amino 
acid sequence as described under the GenBank Accession number BAB10317.1 . 
15 With the expression "homologue" when refemng to homologues of the LRR receptor-like 
. kinase (protein), a polypeptide is meant which is at least 65% identical to the amino acid 
sequence of RCH1 as represented in SEQ ID NO 4 or 19. 

More preferred, the invention relates to a protein consisting of an amino acid sequence 
as given in SEQ ID NO 4 or SEQ ID NO 19. More specifically, the present invention 
20 relates to a root CLAVATA homologue or a functional homologue thereof. A preferred 
fragment of said protein is represented in SEQ ID NO 5. 

A further embodiment of the invention comprises homologues, derivatives, 
immunologically active and/or functional fragments of the LRR receptor-like kinase 
(protein) according to the invention, and proteins comprising said homologues, 
25 derivatives, immunologically active and/or functional fragments of said LRR receptor like 

kinase (protein). 

As such, the present invention also relates to an isolated polypeptide comprising the 
sequence represented in SEQ !D NO 5 encodable by a nucleic acid molecule of the 
invention as defined above, or a homologue or a derivative thereof, or a fragment or an 
30 immunologically active and/or functional fragment thereof. 

In a more preferred embodiment, the invention relates to a polypeptide, encodable by a 
nucleic acid molecule of the invention and which has an amino acid sequence as given 
in SEQ ID NO 4 or SEQ ID NO 19 or comprising an amino acid sequence as given in 
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SEQ ID NO 5, or a homoiogue or a derivative thereof, or an immunologically active 
and or functional fragment thereof. 

Any of said proteins can be produced in a biological system, e.g. a cell culture. 

Airornatively any of said proteins is chemically manufactured e.g. by solid phase peptide 
5 synthesis. Said proteins or fragments thereof can be part of a fusion protein as is the 

case in e.g. a two-hybrid assay which enables e.g. the identification of proteins 

inier acting with the LRR receptor-like kinase according to the present invention. 

Therefore, according to another embodiment, the Invention also relates to a method for 

pfcxAjcing a polypeptide or a protein of the invention comprising culturing a host cell 
10 fanher specified above under conditions allowing the expression of the polypeptide and 

recovering the produced polypeptide from the culture. 

The pfotesns or fragments thereof obtained by a method of the invention are furthermore 
u'ioM e.g. to modulate the interaction between a LRR receptor-!ike kinase according to 
Shtf invention and a ligand of the receptor and/or other identified interacting protein 
15 pixftTHiss Chernicaliy synthesized peptides are particuiarSy usefui e.g. as a source of 
antKjons for the production of antisera and/or antibodies. 

Tho current invention thus furthermore encompasses antisera and/or antibodies 
specif icaliy recognizing the LRR receptor- like kinase according to the invention or 
(mmufxjlogically active parts or epitopes thereof. Said antisera and/or antibodies are 
:o useful m many areas related to the invention including: (1) identification fn any organism, 
pfdofably plants, of other LRR receptor-like kinase and their genes according to the 
invoot>on; (ii) quantification of synthesis in organisms and/or recombinant organisms of 
tne LRR receptor-like kinase according to the invention; (ill) purification of the LRR 
rocoptof-like kinase according to the invention; (iv) immunoprecipitation of the LRR 
recoptof-like kinase according to the invention e.g. as a way to identify other protein 
^viftnofs complexing with said LRR receptor-like kinase; (v) Immunolocalization of the 
LRR receptor-like kinase according to the invention which is expressed in an organism 
ot a recombinant organism. 

Tho invonlion also relates to a method for the production of transgenic plants, plant cells 
30 or p\an\ tissues comprising the introduction of a nucleic acid molecule of the invention in 
an expressible format or a vector as described earlier in said plant, plant cell or plant 

tiSSUO 
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The invention also relates to a method as described above further comprising 
regenerating a plant from said plant celi. 

In a further embodiment, the invention also relates to a transgenic plant ceil comprising a 
nucleic acid sequence of the invention which is operably linked to regulatory elements 

5 allowing transcription and/or expression of said nucleic acid in plant cells or obtainable 
by one of the methods described above. Preferably, in said transgenic plant cell one of 
the nucleic acids of the invention is stably integrated into the genome of said plant cell. 
The invention further relates to a transgenic plant or plant tissue comprising said plant 
cells and to a harvestable part of said plant or tissue. Said harvestable part of said plant 

10 or tissue is preferably selected from the group consisting of seeds, leaves, fruits, stem 
cultures, rhizomes, tubers and bulbs. 

The invention further extends to the progeny derived from any of the plants or plant parts 

described above. 

The present invention is applicable to any plant, in particular a monocotyiedonous plants 

15 and dicotyledonous plants including a fodder or forage legume, ornamental plant, food 
crop, tree, or shrub selected trom the list comprising Acacia spp., Acer spp., Aotinidia 
spp.,Aesculus spp., Agathis australis, Atbizia amara, Alsophila tricolor, Anciropogon spp., 
Arachis spp, Areca catechu, Astelia fragrans, Astragalus cicer, Baikiaea plurijuga, Betula 
spp., Brassica spp., Bruguiera gymnorrhiza, Burkea africana, Buiea frondosa, Cadaba 

20 farinosa, Calliandra spp. Camellia sinensis, Carina indica, Capsicum spp., Cassia spp., 
Centroema pubescens, Chaenomeles spp., Cinnamomum cassia, Coffea arabica, 
Coiophospermum mopane, Coronillia varia, Cotoneaster serotina, Crataegus spp., 
Cucumis spp., Cupressus spp., Cyathea dealbata, Cydonia oblonga, Cryptomeria 
japonica, Cymbopogon spp., Cynthea dealbata, Cydonia oblonga, Dalbergia monetaria, 

25 Davallia divaricata, Desmodium spp., Dicksonia squarosa, Diheteropogon amplectens, 
Dioclea spp, Dolichos spp., Dorycnium rectum, Echinochloa pyramldalls, Ehrartia spp., 
Eleusine coracana, Eragrestis spp., Erythrina spp.. Eucalyptus spp., Euclea schimperi, 
Eulalia villosa, Fagopyrum spp., Feljoa sellowiana, Fragaria spp., Flemingia spp, 
Freycinetia banksii, Geranium thunbergii, Ginkgo biloba, Glycine javanica, Gliricidia spp, 

30 Gossypium hirsutum, Grevsllea spp., Guibourtia coleosperma, Hedysarum spp., 
Hemarihia alti'ssima, Heteropogon contortus, Hordeum vulgare, Hyparrhenia rufa, 
Hypericum erectum, Hyperttielia dissoluta. Indigo incarnata, Iris spp., Leptarrhena 
pyrolifolia, Lespediza spp., Lettuca spp., Leucaena leucocephala, Loudetia simplex. 
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Lotonus bainesii, Lotus spp., Macrotyloma axillare, Malus spp,, Manihot esculenta, 
Medlcago satlva, Metasequoia glyptostroboldes, Musa sapientum, Nicotianum spp., 
Onobrychis spp., Ornithopus spp., Oryza spp., Peltophorum africanum, Pennisetum 
spp., Persea gratissima. Petunia spp., Phaseolus spp., Phoenix canadensis, Phormium 

5 cool<ianum, Pfiotinia spp., Picea glauca, Pinus spp., Pisum sativum, Podocarpus totara, 
PogonarthrJa flecl<ii, Pogonartlina squarrosa, Populus spp., Prosopis cineraria, 
Pseudotsuga menziesii, Pterolobium steilatum, Pyrus communis, Quercus spp., 
Rhaphiolepsis umbellata, Rhopalostylis sapida, Rhus natalensis, Ribes grossulan'a, 
Ribes spp., Robinia pseudoacacia, Rosa spp., Rubus spp., Saiix spp,, Schyzacliyrium 

10 sanguineum, Sciadopitys verticillata. Sequoia sempervirens, Sequoiadendron 
gigantBum, Sorghum bicolor, Spinacia spp., Sporobolus fimbriatus, Stiburus 
alopecuroides, Stylosantlios humilis, Tadehagi spp, Taxodium disilchum, Themeda 
triandra, Tnfolium spp., Tritlcum spp., Tsuga heterophyila, Vaccinium spp., Vicia 
spp. Vitis vinifera, Watsonia pyramidata, Zantedeschia aethiopica, Zea mays, amaranth, 

15 artichoke, asparagus, broccoli, brussel sprout, cabbage, canoia, carrot, cauiif lower, 
ceiery, coOard greens, flax, kale, leriti!, oliseed rape, okra, onion, potato, rice, soybean, 
straw, sugarbeet, sugar cane, sunflower, tomato, squash, and tea, amongst others, or 
the seeds of any plant specifically named above or a tissue, cell or organ culture of any 
of the above species. 

20 Yet according to another embodiment the present invention relates to a method for 
conferring root-specificity and/or root-meristem-specificlty and/or root-vascular-tissue- 
specificity and/or root-endodermis-specificity and/or root-cortex-specificity and/or root- 
epidermis-specificity and/or lateral-root cap-specificity and/or abundant expression in 
roots to other promoter sequences comprising the fusion of at least part of the DNA 

25 sequence as given in SEQ ID NO 1, 2 or 18 to a (second) transcriptional regulatory 
promoter sequence normally not exhibiting root-specificity. . 

The invention further relates to method for root-specific expression of a gene(s) of 
interest comprising operably linking of said gene(s) of interest to a transcriptional 

regulatory root-specific promoter comprising at least part of the nucleic acids as given in 
30 SEQ ID NO 1, 2 or 18 as defined above, and possibly in combination/or a (second) 
transcriptional regulatory promoter sequence not normally exhibiting root-specificity, as 
defined earlier. 
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Tho invention further relates to a method for modifying cell fate and/or plant 
dcvcJopment and/or plant morphology and/or biochemistry and/or physiology comprising 
the modification of expression in the meristem of the main and the lateral roots, or in the 
vascular tissue of the root, or in the lateral root primordia of a gene(s) of interest 
op€fably linked to a transcriptional regulatory root promoter as defined earlier. 
The ability to drive tissue-specific expression of any gene of interest in plants is 
considered to be of great agronomic importance. This art can be used to exclude 
expression in eatable parts of the plant, which averts the uptake of ectopic components. 
AiTomaiiveiy the expression of the transgene can be restricted to the relevant tissues of 
!h<? plant so the total organism does not spend too much energy in superfluous 
prcxjjcton of the foreign component. This technique can also reduce potential yield loss 
by umfting the expression of some pernicious, yet usefui agronomic genes, to specific 
ttssues only. Said genes of interest, which can be expressed in a root-specific manner 
undef the control of the present invention can also be involved in phytoremediation, 
cfnKinced root-growth (Hke cell cycle genes or cytokinin oxidase) or disease tolerance. 
These methods are more elaborated in detail beiow. 

The invention described here provides a new transcriptional regulatory element which 
can be operably linked to a gene effecting and/or modifying root metabolites which can 
be involved in phytoremediation. Examples are genes involved in the production of citric 
and maleic acids that can be secreted by the plant and subsequently form complexes 
wtth the aluminium in the soil. Root specific expression of the aluminium-resistance gene 
ttvat give rise to said metabolites can therefore allow plants to grow on acid soils, like 
those of south-eastern United States, Central and south America, North Africa and parts 
ol Indta and China, where the aluminium is set free as an ion that poisons plant roots. 
Alternatively, the promoter of the present invention also provides a tool to be used in a 
strategy for environmental remediation. Accordingly, a metal-resistance gene such as 
ihe TaPCSf or the CAD1 gene, contributing cadmium tolerance, can be expressed 
s|>ecsf!ca[!y in roots to enable enhanced growth of transgenic plants on soils 
coriiaminated with cadmium. The metals can consequently be extracted from the soil 
and stored in the plant material, which could then be harvested and incinerated. 
ThK?fL'fore according to preferred embodiments, the present invention relates to a 
method for phytoremidiation or environmental remediation comprising the expression of 
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a gene(s) of interest under the control of a transcriptional regulatory root promoter as 

defined earlier. 

In conclusion tlie present invention provides a metiiod for broadening the range of soils 
or environments in which a plant may thrive or that can be used for agriculture. 

5 Plant root development is an essential determinant of plant growth and crop yield, so 
that yield could be enhanced by induced changes in expression of root-specific 
regulatory factors. Cyclin D for example, which promotes the cell division, can be 
expressed under the root-specific control of the current invention and hence contribute to 
increased growth rate and crop yield. Also genes that are involved in biosynthesis and 

10 perception of phytohormones like auxins, cytokinines, ethylene, abscisic acid and 
gibebereilins can be used to enhance root growth and development. However, the root- 
specific expression of these genes might be pivotal for the overall welfare of the 
transgenic plant. 

It is known that the constitutive and ubiquitous expression of the cytokinine oxidase in 
15 Arabidopsis thaliana leads to the promotion of an enlarged root system and enhanced 
root biomass, but also to diminished shoot grovyth. To overcome this problem the 
cytokinin oxidase can be operably linked to the promoter of the present invention as 
described in example 6, which will enhance root growth without negatively affecting 
shoot gro\,vth. This enhanced root growth in the piant can be used for bioremidiation or 
20 phyloremediation. 

Also the said gene of interest, that can be operabiy linked to the promoter of the present 
invention, can be a root transcription factor, such as Aifin. Alfin binds to promoter 
fragments of predominantly root-specific and salt inducible genes and its activity has 
major implications for crop plant yield (Winicov 2000) (Winicov & Bastola 1999)). In 
25 conclusion the present invention provides a method for enhancing plant growth and crop 
yield. 

It is also clear for the skilled artisan that the promoter of the present invention provides a 
useful tool to be used for the Introduction of resistance to soil-borne pathogen attack to 
plants, a widespread target for genetic improvement of crop plants. Certain disease- 
30 inducing microorganisms attack the belbwground plant tissue and any genetic 
modification to contribute resistance to such organisms will require expression of the 
resistance-gene in the roots. To overcome this problem, the coding region that Is 
operably linked to the promoter of the present invention preferably encodes a protein 
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which is toxic to root-attaclcing organisms- and more preferably the protein is an 
insecticidal endotoxin of Bacillus thuringiensis. Similarly to the above-mentioned 
pathogens, there are pests attacking plants. Certain insect species attacl< green leaf 
tissue, whereas other, for example coleoptera, attack the roots. Corn root worni 

5 {Diabrotica undecimpunctata howardi Barbei) for example is a particularly difficult pest to 
control or to eradicate. It attacks the plant below the soil line, where insecticides are 
difficult or impossible to apply effectively. Com root worm can be eradicated by root- 
- specific expression of the Cry III gene, which produces a component, specifically toxic 
for coleoptera. Also root-specific expression of the limonene cyclase gene in 

10 combination with GPPP synthetase would be lar\'icidai. Another pest, nematodes, that 
penetrate the root and form root-knots or cysts, which cause severe damage to crops 
throughout the world, couid be treated with the root-specific expression of nematode 
resistance genes such as Hs1°^^"\ Ni-1 and Gpa2. 

in conclusion the invention disclosed here provides a method for conierring enhanced 
15 resistance to pathogens to a transgenic plant which pathogens attack the belowground 
plant tissue comprising the expression of a gene(s) of interest under the control of a 
transcriptional regulatory root-specific promoter as defined earlier. 
In yet another embodiment of the invention, the invention provides a method to 
enhanced freezing tolerance comprising the expression of a gene(s) of interest under 
20 the control of a transcriptional regulatory root-specific promoter as defined earlier. 
Preferred is the overexpressing of the Arabidopsis thaliana DNA binding factors CBF1 
and DREB1A that Induce cold-regulated (COR) genes (Jaglo-Ottosen et al, 1998). 
However, constitutive expression of these genes in all organs of the plant led to growth 
retardation under normal growth conditions. Root-specific expression of these freezing- 
25 tolerance genes under control of the promoter sequence of the present invention can 
make the plant tolerant to cold temperatures of the soil without affecting the growth of 
the plant. This technique can have important agronomic potential since lots of soils are 
inaccessible for crop production because of presence of permanently frozen layers in the 
soil. 

30 Alternatively, other genes of interest can toierize the transgenic plant for other weather 
conditions like drought (e.g. oat arginin decarboxylase gene) and high salt conditions 
(e.g. MtlD gene), in conclusion, the present Invention provides a method to contribute to 
a transgenic plant stress-tolerance. 
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Further advantages to root-preferred gene expression include the production of useful 
proteins in an industrial setting. Ligni-sensirive proteins may be synthesized in root 
tissue such that said proteins are not exposed to light. Therefore the present invention 
also relates to a method for the production of light-sensitive proteins comprising the 
5 expression of a gene encoding said light-sensitive protein under the control of a 
transcriptional regulatory root-specific promoter of the invention. 
Also considered as an embodiment of the invention is the use of the RCH1 promoter for 
applications based on the inhibition of expression of native DNA sequences within the 
plant's root to achieve a desired phenotype resulting from the silencing of the native 
10 gene, in this case, such inhibition might be accomplished with transformation of the plant 
to comprise a transcriptional regulatory root-specific promoter of the present invention 
operably linked to e.g. an antisense nucleotide sequence to the gene of interest, a gene 
silencing construct or a ribozyme. 

Those skilled in the art will be aware that plants secure the formation of organs 
15 throughout their life span by developing and maintaining a coiiection of stem cells 
termed the meristem. The promoter of the present invention offers the opportunity to 
deliver a certain protein specifically to these cells. For example overexpression of genes 
ttiat influence the proiiferation and/or the differentiation of the root meristem could 
contribute to a higher turnover of the meristem ceils and therefor enhance the root 
20 architecture. 

The promoter of the present Invention drives the Arabidopsis thaliana gene ROOT 
CLAVATA HOMOLOG 1 (RCH1). Since the c/avafa-genes are key role players in shoot- 
meristem maintenance, persons skilled in the art may recognize the possible 
involvement of the RCH1 operon in root meristem maintenance. Therefor one could 
25 assume that the control of this promoter is influenced by signals that promote meristem 
maintenance. This might be an advantage when aiming to stimulate the root meristem- 
formation, which eventually could result in more roots, higher uptake of resources and in 
higher crop yield. 

Therefore, according to preferred embodiments, the invention relates to methods for 
30 stimulating meristem formation and/or maintaining root meristem comprising the 
expression of a gene that influences the proliferation and/or differentiation of the root 
mersiteme under the control of a transcriptional regulatory root (-specific) promoter of the 
invention. 
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The present invention therefore relates to a metlnod for stimulating root nneristem 
formation or for root-meristeme maintenance comprising the expression in particular 
cells, tissues or organs of a plant, of a nucleic acid encoding an LRR receptor-lil<e 
kinase, for instance the LRR receptor-like kinase of the invention, wherein optionally said 
5 nucleic acid is operably linked to a plant-operable promoter sequence, for instance said 
plant-operable promoter being the transcriptional regulatory root promoter as defined 
earlier. 

As mentioned supra, the combined gene expression data and homology to CLAVATA 1 
indicate that the novel plant LRR receptor-tike kinase of the invention, RCH1 (SEQ ID 

10 Ho. 3, 4, 1 9), is very likely to exert a function that is involved in the development and/or 
maintenance of the root meristem. As the root \s the main plant organ for nutrient uptake 
from the environment, enhanced root formation could eventually lead to overall growth 
enhancement and better yield. Therefore preferred genes to be expressed in the 
methods for stimulating meristem formation and/or for root meristeme maintenance are 

IS genes encoding the polypeptide(s) of the invention. 

' The present invention also relates to a method for enhancing root formation and/or root 
growth comprising the expression in particular ceils, tissues or organs of a plant, of a 
nucleic acid encoding an LRR receptor-like kinase, for instance the LRR receptor-like 

kinase of the invention, wherein optionally said nucleic acid is operably linked to a plant- 
20 operable promoter sequence, for instance said piant-operable promoter being the 
transcriptional regulatory root promoter as defined earlier. 

The present invention therefore relates to a method for enhancing overall growth and 
yield comprising the expression in particular cells, tissues or organs of a plant, of a 
nucleic acid encoding an LRR receptor-like kinase, for instance the LRR receptor-like 
25 kinase of the invention, wherein optionally said nucleic acid is operably linked to a plant- 
operable promoter sequence, for instance said plant-operable promoter being the 
transcriptional regulatory root promoter as defined earlier. 

The present invention also relates to a method for enhancing overall growth and yield 
comprising the expression of a protein of the invention under the control of a 
30 transcriptional regulatory root (or root-specific) promoter of the invention. 

Also In yet other embodiments of the invention, methods are provided for modifying cell 
fate and/or plant development and/or plant morphology and/or biochemistry and/or 
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physiology comprising the modification of expression in particiiiar cells, tissues or organs 
of a plant, of a nucleic acid encoding an LRR receptor-like kinase, for instance the LRR 
receptor-like kinase of the invention and defined above, wherein optionally said nucleic 
acid is operably linked to a plant-operable promoter sequence, for instance said piant- 
5 operable promoter being the transcriptional regulatory root promoter as defined earlier. 

The present Invention further relates to a method to confer pathogen resistance to a 
transgenic plant comprising the expression in particular cells, tissues or organs of a 
plant, of a nucleic acid encoding an LRR receptor-like kinase, for instance the LRR 
receptor-like kinase of the invention and defined above, wherein optionally said nucleic 

10 acid is operably linked to a plant-operable promoter sequence, for instance said plant- 
operable promoter being the transcriptional regulatory root promoter as defined earlier. 
Many pathogen resistance genes are LRR receptor-like proteins, which initiate a signal 
transduction pathway in the root cells, leading to the activation of defense mechanisms 
against the pathogen. The nove! LRR receptor-like kinase might contribute to pathogen 

15 resistance to the transgenic plant when transformed herein. 

Yet another embodiment of the invention relates to methods to perform a functional 
analysis of the DNA sequence according to the present invention, encoding a novel LRR 
receptor-like protein. For persons skilled in the art it is possible to make single or double 
knock-out mutants of Arabidopsis thallana, wherein the DNA of the present invention is 

20 inactivated. Also introduction of the antlsense sequence of the DNA of the invention, as 
well as the overexpression of its sense sequence in a transgenic plant can elucidate a 
specific feature in this transgenic plant. Furthermore, the loss of function of this gene of 
the invention by introduction of dominant negative mutations can result in interesting 
characteristics of the mutant plant. 

25 Another aspect of the present invention is the revelation of a new target for herbicides or 
growth regulators. When said gene of the invention is an essential gene for plant cell 
viability, it can be a target for a herbicide or growth regulator. 

Therefore the invention relates to a method for identifying and obtaining proteins 
interacting with a (LRR receptor-like kinase) polypeptide of the invention comprising a 

30 screening assay wherein said polypeptide is used or expressed. 

In a preferred embodiment the invention relates to a method for identifying and obtaining 
proteins interacting with a an LRR receptor-like kinase (protein) comprising a two-hybrid 
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screening system wherein a nucleic acid encoding a polypeptide of the invention as a 
bait and a cDNA library as prey are expressed. 

The invention also relates to a method for modulating the interaction between an LRR 
receptor-like kinase (protein) of the invention and interacting proteins obtainable by a 
5 method as described above. 

The invention further relates to a method for identifying and obtaining compounds 
interacting with an LRR receptor-like kinase (protein) comprising the steps of: 

a) providing a two-hybrid screening system wherein a polypeptide or protein of 
the invention and a protein interacting with said (LRR receptor-like kinase) 

10 poiypeptide or protein or an interacting protein obtainable by a method of 

claim as claimed above are expressed, 

b) interacting said compound with the complex formed by the expressed proteins 
as defined in a), 

c) detecting a second complex, wherein the presence of said second complex 
15 identifies a compound which specifically binds to one of said polypeptides or 

said second complex, and 

d) identifying the compound. 

The invention also relates to a method for identifying compounds or mixtures of 
compounds which specifically bind to a polypeptide of the invention, comprising: 

20 a) combining a poiypeptide of the invention with said compound or mixtures of 

compounds under conditions suitable to allow complex formation, and, 
b) detecting complex formation, wherein the presence of a complex identifies a 
molecule which specifically binds said polypeptide. 
The invention further relates to the use of a molecule identified by means of one of the 

25 methods described above as a plant growth regulator or herbicide. 

The invention also relates to a method for the production of a plant growth regulator or 
herbicide composition comprising the steps of one of the methods described above and 
formulating the compounds obtained from said steps in a suitable form for the application 
in agriculture or plant ceil or tissue culture. 

30 The invention also relates to the use of any of the nucleic acid moiecules, vectors, 
polypeptides or antibodies of described herein for modifying ceil fate and/or plant 
development and/or plant morphology and/or plant biochemistry and/or plant physiology. 



wo 02/46439 PCT/EPO 1/14 154 



The invention also relates to a diagnostic composition comprising at least one of the 
nucleic acid molecules, vectors, polypeptides or antibodies of the invention. 
It is further important to notice that CLAVATA 1 has nriany analogues in other plant 
species, so it is likely that also the promoter of RCH1, the RCH1 ORF or the complete 
5 RCH1 operon can be functional in those other plant species, and possibly exert the 
same specific activation pattern. This can be investigated by root-specific reporter gene 
expression in rice mediated by the Arabidopsis thaliana RCH1 promoter as described in 
example 5. 

Definitions and elaborations to the embodiments 

10 The terms "protein{s)", "peptide(s)" or "oligopeptide(s)", when used herein refer to amino 
acids in a polymeric form of any length. Said terms also include known amino acid 
modifications such as disulphide bond formation, cysteinylation, oxidation, 

glutathionyiation, methyiation, acetyiation, farnesylation, biotinylation, stearoylation, 
formyiation, lipoic acid addition, phosphorylation, sulphation, ubiquitination, 

15 myristoyiation, pafmitoyiation, geranylgeranylation, cyclization (e.g. pyrogiutamic acid 
formation), oxidation, deamidation, dehydration, giycosyiation (e.g. pentoses, 
hexosamlnes, N-acetylhexosamines, deoxyhexoses, hexoses, sialic acid etc.), acylation 
and radioiabeis (e.g. ^^^i, ^^^!, ^^S, ^''C, ^^P, '''^P, ^H) as well as non-naturaiiy occurring 
amino acid residues, L-amino acid residues and D-amino acid residues. 

20 "Homologues" or "Homoiogs" of a protein of the invention are those peptides, 
oligopeptides, polypeptides, proteins and enzymes which contain amino acid 
substitutions, deletions and/or additions relative to the said protein with respect to which 
they are a homolog, without altering one or more of its functional properties, in particular 
without reducing the activity of the resulting. For example, a homolog of said protein will 

25 consist of a bioactive amino acid sequence variant of said protein. To produce such 
homoiogs, amino acids present in the said protein can be replaced by other amino acids 
having similar properties, for example hydrophobicity, hydrophilicity, hydrophobic 
moment, antigenicity, propensity to form or break a-helical structures or p-sheet 
structures, and so on. An overview of physical and chemical properties of amino acids is 

30 gjven in Table 1 . 

Substitutional variants of a protein of the invention are those in which at least one 
residue in said protein amino acid sequence has been removed and a different residue 
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inserted in its place. Amino acid substitutions are typically of single residues, but may be 
clustered depending upon functional constraints placed upon the polypeptide; insertions 
will usually be of the order of about 1-10 amino acid residues, and deletions will range 
from about 1-20 residues. Preferably, amino acid substitutions will comprise 
5 conservative amino acid substitutions, such as those described supra. 
Table 1: Properties of naturally occurring amino acids. 



Charge properties / 


Side group 


Amino Acid 


hydrophobicity 






nonpolar 


Aliphatic 


Ala, Sle, Leu, 


hydrophobic 


Aliphatic, S-containing 


Val 




Aromatic 


Met 




Imino 


Phe, Trp 
Pro 


polar uncharged 


Aliphatic 


Gly 




Amide 


Asn, Gin 




Aromatic 


Tyr 




Hydroxyl 


Ser. Thr 




Suifhydryl 


Cys 


positively charged 


Basic 


Arg, His, Lys 


negatively charged 


Acidic 


Asp, Gly 



10 insertional amino acid sequence variants of a protein of the invention are those in which 
one or more amino acid residues are introduced into a predetermined site in said protein, 
insertions can comprise amino-terminal and/or carboxy-terminal fusions as well as intra- 
sequence insertions of single or multiple amino acids. Generally, insertions within the 
amino acid sequence will be smaller than amino or carboxyl terminal fusions, of the 

15 order of about 1 to 10 residues. Examples of amino- or carboxy-terminal fusion proteins 
or peptides Include the binding domain or activation domain of a transcriptional activator 
as used in the yeast two-hybrid system, phage coat proteins, (histidine)s-tag, glutathione 
S-transf erase, protein A, maltose-binding protein, dihydrofolate reductase, Tag«100 
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epitope (EETARFQPGYRS), c-myc epitope (EQKLISEEDL), FLAG®-epitope 
(DYKDDDK), lacZ, CMP (calmodulin-binding peptide), HA epitope (YPYDVPDYA), 
protein C epitope (EDQVDPRLIDGK) and VSV epitope (YTDIEMNRLGK). 
DeietKjnal variants of a protein of the invention are characterised by the removal of one 

5 Of more amino acids from the amino acid sequence of said protein. 

HorrK>(ogues of the peptides or polypeptides of the Invention contain a number of amino 
aod substitutions, deletions and/or additions relative to said peptide or polypepthide 
such that said homotogues are still substantially different from the peptides or 
poiypeptides known in the art. The difference between proteins can be calculated in 

10 lefms of % identicity according to methods wed known in the asl. Preferably, the 
homoksgues of the peptides or polypeptides of the present Invention comprise the amino 
acxi sequence represented in SEQ ID NO 6, optionaily comprising minor amino acid 
substitutsons such as those described in table 1. Other homologues of the polypeptides 
o« the invention have an amino acid sequence which is at least 65 % identical to the 

1 .1 soqu«oce represented in SEQ ID NO 4 or 19. 

Amino acid variants of a protein of the invention may readily be made using peptide 
synthetic techniques well known In the art, such as solid phase peptide synthesis and 
itie bke, or by recombinant DNA manipulations. The manipulation of DNA sequences to 
produce variant proteins which manifest as substitutional, insertional or deletional 

20 vanants are well known In the art. For example, techniques for making substitution 
mutations at predetermined sites In DNA having known sequence are well known to 
those skilled in the art, such as by M13 mutagenesis, T7-Gen in vitro mutagenesis kit 
(USB, Cleveland, OH), QuickChange Site Directed mutagenesis kit (Stratagene, San 
D»ego. CA), PCR-mediated site-directed mutagenesis or other site-directed mutagenesis 

2S protocols. 

"Oeffvatives" of a protein of the invention are those peptides, oligopeptides, 
po»yp«Pt>des, proteins and enzymes which comprise at least about five contiguous 
amtno acid residues of said polypeptide but which retain the biological activity of said 
protesn and which may further comprise additional naturally-occurring, altered 
30 glycosylated, acyiated or non-naturaily occurring amino acid residues compared to the 
amino acid sequence of a naturally-occurring form of said polypeptide. Alternatively or in 
addition, a derivative may comprise one or more non-amino acid substituents compared 
to the amino acid sequence of a naturally-occurring form of said polypeptide, for 
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example a reporter molecule or other ligand, covalently or non-covalently bound to the 
amino acid sequence such as, for example, a reporter molecule which is bound thereto 
to facilitate its detection. Preferably, the derivates of the proteins of the present invention 
comprise the amino acid acid sequence as represented in SEQ ID NO 5. 
5 With "immunologically active" is meant that a molecule or specific fragments thereof 
such as epitopes or haptens are recognized by, I.e. bind to antibodies. Preferably, the 
immunoiogically active or fragments of the proteins of the present invention comprise the 
amino acid acid sequence as represented in SEQ ID NO 5, 

With "tunctionai tragmenf or "functional homologue" is meant a protein which comprises 

10 at least about 5,10, 20, 50, 75, 100, 200, 300. 400, 500, 600, 700, 800, 900, 1000. 1050, 
1100, 1110, 1120, 1130, 1131, 1132, 1133 or 1134 contiguous amino acid residues of 
said polypeptide but which retains the biological activity of said protein. Preferably, the 
functional fragments or functional homoiogues of the polypeptide or proteins of the 
invention comprise the amino acid sequence as represented in SEQ ID NO 5 but 

15 wherein minor amino acid substitutions are allowed (see table 1 ). 

In the context of the current invention are embodied homologs, derivatives and/or 
immunologically active fragments of any of the inventive LRR receptor-like kinase protein 
or polypeptide or homolog, derivative or fragment thereof as defined supra. 
"Antibodies" include monoclonal, polyclonal, synthetic or heavy chain camel antibodies 

20 as well as fragments of antibodies such as Fab, Fv or scFv fragments. Monoclonal 
antibodies can be prepared by the techniques as described in e.g. Liddle and Cryer 
(1 991 ) which comprise the fusion of mouse myeloma cells to spleen cells derived from 
immunized animals. Furthermore, antibodies or fragments thereof to a molecule or 
fragments thereof can be obtained by using methods as described in e.g. Harlow and 

25 Lane (1 988). In the case of antibodies directed against small peptides such as fragments 
of a protein of the invention, said peptides are generally coupled to a carrier protein 
before immunization of animals. Such protein carriers include keyhole limpet 
hemocyanin (KLH), bovine serum albumin (BSA), ovalbumin and Tetanus toxoid. The 
carrier protein enhances the immune response of the animal and provides epitopes for 

30 T-cell receptor binding sites. The term "antibodies" furthermore includes derivatives 
thereof such as labelled antibodies. Antibody labels include alkaline phosphatase, 
PKH2, PKH26, PKH67, fluorescein (FITC), Hoecst 33258, R-phycoerythrin (PE), 
rhodamine (TRITC), Quantum Red, Texas Red, Cy3, biotin, agarose, peroxidase, gold 
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Spheres and radiolabels (e.g. '^^i, ^^1, ^^S, ■'*C, ^-P, ^H). Tools ia molecular biology 
relying on antibodies against a protein include protein gel biot analysis, screening of 
expression libraries allowing gene identification, protein quantitative metfiods including 
ELISA and RIA, immunoaffinity purification of proteins, immunoprecipitation of proteins 

5 (e.g. Magyar et al. 1997) and immunolocalization. Other uses of antibodies and 
especially of peptide antibodies include the study of proteolytic processing (Loffler et al. 
1994, Woulfe et al. 1994), determination of protein active sites (Lerner 1982), the study 
of precursor and post-translational processing (Baron and Baltimore 1 982, Lerner et al. 
1 981 , Semler et al. 1 982), identification of protein domains involved in protein-protein 

10 interactions (Murakami et al. 1992) and the study of exon usage in gene expression 
(Tamura et al. 1991). 

Embodied in the current invention are antibodies recognizing a LRR receptor-like kinase 
polypeptide or protein, or homolog, derivative or fragment thereof as defined supra or 
specific epitopes of said polypeptide or protein. 

15 The terms "gene(s)", "polynuGieotide(s)", "nucleic acid sequenc;e(s)", "riucieotlde 
sequence(s)", "DNA sequence(s)" or "nucleic acid rnoiecule(s)", v^hen used herein refer 
to nucieotides, either ribonucleotides or deoxyribonucleotides or a combination of both, 
in a polymeric form of any iength. Said terms furthermore include double-stranded and 
single-stranded DNA and RNA. Said terms also include l<nown nucleotide modifications 

20 such as methylation, cyclization and 'caps' and substitution of one or more of the 
naturally occurring nucleotides with an analog such as inosine. Modifications of 
nucleotides include the addition of acridine, amine, biotin, cascade blue, cholesterol, 
Cy3®, Cy5®, Cy5.5® Dabcyl, digoxigenin, dinitrophenyl, Edans, 6-FAM, fluorescein, 3'- 
glyceryl, HEX, IRD-700, IRD-800, JOE, phosphate psoralen, rhodamine, ROX, thiol 

25 (SH), spacers, TAMRA, TET, AMCA-S®, SE, BODIPY®, Marina Blue®, Pacific Blue*, 
Oregon Green®, Rhodamine Green®, Rhodamine Red®, Rhodol Green® and Texas 
Red®. Polynucleotide backbone modifications include methylphosphonate, 2'-0Me- 
methylphosphonate RNA, phosphorothiorate, RNA, 2'-0MeRNA. Base modifications 
include 2-amino-dA, 2-aminopurine, 3'-(ddA), 3'dA(cordycepin), 7-deaza-dA, 8-Br-dA, 8- 

30 oxo-dA, N®-Me-dA, abasic site (dSpacer), biotin dT, 2'-0Me-5Me-C, 2'-OMe-propynyUC, 
3'-(5-Me-dC), 3'-(ddC), 5-Br-dC, 5-l-dC, 5-Me-dC, 5-F-dC, carboxy-dT, convertible dA, 
convertible dC, convertible dG, convertible dT, convertible dU, 7-deaza-dG, 8-Br-dG, 8- 
oxo-dG, 0^-Me-dG, S6-DNP-dG, 4-methyl-indole, 5-nitroindole, 2'-OMe-inosine, 2'-dl, 0^- 
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phenyl-dl. 4-methyi-indole, 2'-deoxynebu!anne, 5-nitroindole, 2-aminopurine, dP(purine 
analogue), dK(pyrimidine analogue), 3-nitropyrroie, 2-thio-dT, 4-thio-dT, biotin-dT, 
cartxixy-dT, 0*-Me-dT, 0*-triazoi dT, 2'-OMe-propynyl-U, 5-Br-dU, 2'-dU, 5-F-dU, 5-l-dU, 
O'-tnazol dU and radiolabels (e.g. ^^^1, ^S, ^*C, ^^P. ^H). Said terms also 

5 encompass peptide nucleic acids (PNAs), a DNA analogue in whicii the backbone is a 
pseuciopeptide consisting of N-(2-aminoethyl)-glycine units rather than a sugar. PNAs 
rrmvc the behaviour of DNA and bind complementary nucleic acid strands. The neutral 
bacKixjne of PNA results in stronger binding and greater specificity than normally 
achieved. In addition, the unique chemical, physical and biological properties of PNA 

10 have been exploited to produce powerful biomoiecular tools, antisense and antigens 
agents, moiecular probes and biosensors. With "recombinant DNA molecule" or 
"chsmorsc gene" is meant a hybrid DNA produced by joining pieces of DNA from different 
soufcos.Wfth "heterologous nucleotide sequence" is intended a sequence that is not 
natufalty occurring with the promoter sequence. White this nucleotide sequence is 

1^ ix;tofologous to the promoter sequence, it may be homoiogous, or native, or 
hetofologous, or foreign, to the plant host. "Sense strand" refers to the strand of a 
ckxjbte-stranded DNA molecule that is homologous to a mRNA transcript thereof. The 
*anu-sense strand" contains an inverted sequence which is complementary to that of the 
*sonse strand". 

2t) A •coding sequence" or "open reading frame" or "ORF" Is defined as a nucleotide 
sequence that can be transcribed into mRNA and/or translated into a polypeptide when 
ptacod under the control of appropriate regulatory sequences, i.e. when said coding 
sequence or ORF is present in an expressible format. Said coding sequence of ORF is 
tx>undod by a 5' translation start codon and a 3' translation stop codon. A coding 

25 sequence or ORF can Include, but is not limited to RNA, mRNA, cDNA, recombinant 
nucleotide sequences, synthetically manufactured nucleotide sequences or genomic 
DNA Said coding sequence or ORF can be interrupted by intervening nucleic acid 
sequences. 
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Table 2. Degeneracy of the genetic code. 



Amino Acid 


Three- 
letter 
code 


One- 
letter 
code 


Possible codons 


Alanine 


Ala 


A 


GCA 


GCC 


GGG 


GGU 






Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


GGG 


CGU 


Asparagine 


Asn 


H 


AAC 


AAU 










Aspartic Acid 


Asp 


D 


GAC 


GAU 










Cysteine 


Cys 


C 


U6C 


UGU 










Glutamic Acid 


Glu 


E 


GAA 


GAG 










Giutamine 


Gin 


Q 


CAA 


GAG 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






HistJdine 


His 


H 


CAC 


CAU 










Isoleucine 


He 


1 


AUA 


AUG 


AUU 








Leucine 


Lgu ~^ 


L 


UUA 


UUG 


CUA 


cue 


GUG 


CUU 


Lysine 


Lys 


K 


AAA 


AAG 










Methionine 


Met 


M 


AUG 












Phenyiaianine 






UUG 


UUU 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 






Serine 


Ser 


S 


AGO 


AGU 


UGA 


UCC 


UCG 


UCU 


Threonine 


Thr 


T 


ACA 


ACQ 


ACG 


ACU 






Tryptophan 


Trp 


W 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 










Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 





Possible "STOP" codons 



UAA I UAG I UGA | | | 

Genes and coding sequences essentially encoding Ihe same protein but isolated from 
5 different sources can consist of substantially divergent nucleic acid sequences. 
Reciprocally, substantially divergent nucleic acid sequences can be designed to effect 
expression of essentially the same protein. Said nucleic acid sequences are the result of 
e.g. the existence of different alleles of a given gene, of the degeneracy of the genetic 
code or of differences in codon usage. Thus, as indicated in Table 2, amino acids such 
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as methionine and tryptophan are encoded by a singie codon whereas other amino 
acids such as arginine, leucine and serine can each be translated from up to six different 
codons. Differences In preferred codon usage are illustrated below for Agrobacterium 
tumefaciens (a bacterium), A. thaliana, M. sativa (two dicotyledonous plants) and Oryza 

5 sativa (a monocotyledonous plant). These examples were extracted from 
{http.//www.kazusa.or.jp/codon). To give one example, the codon GGC (for glycine) is 
the most frequently used codon in A. tumefaciens (36.2 %o), is the second most 
frequently used codon in O. sativa but is used at much lower frequencies in A. thaliana 
and M. sativa (9 %o and 8.4 %o, respectively). Of the four possible codons encoding 

10 glycine (see Table 6), said GGC codon is most preferably used in A. tumefaciens and O. 
sativa. However, in A. thaliana this is the GGA (and GGU) codon whereas in M. sativa 
this is the GGU (and GGA) codon. 

"Hybridization" is the process wherein substantially homoiogous complementary 
nucleotide sequences anneal to each other. The hybridization process can occur entirely 

15 in solution, i.e. both complementary nucleic acids are in solution. Tools in molecular 
biology reiying on such a process include the polymerase chain reaction (PGR; and ail 
methods based thereon), subtractive hybridization, random primer extension, nuclease 
S1 mapping, primer extension, reverse transcription, cDNA synthesis, differentia! display 
of RNAs, and DMA sequence determination. The hybridization process can also occur 

20 with one of the complementary nucleic acids Immobilized to a matrix such as magnetic 
beads, Sepharose beads or any other resin. Tools in molecular biology relying on such a 
process include the isolation of poly (A+) mRNA. The hybridization process can 
furthermore occur with one of the complementary nucleic acids immobilized to a solid 
support such as a nitrocellulose or nylon membrane or immobilized by e.g. 

25 photolitography to e.g. a silicious glass support (the latter known as nucleic acid arrays 
or microarrays or as nucleic acid chips). Tools in molecular biology relying on such a 
process include RNA and DNA gel blot analysis, colony hybridization, plaque 
hybridization, in situ hybridization and microarray hybridization. In order to allow 
hybridization to occur, the nucleic acid m.olecules are generally thermaliy or chemically 

30 denatured to melt a double strand into two single strands and/or to remove hairpins or 
other secondary structures from singie stranded nucleic acids. The stringency of 
hybridization is influenced by conditions such as temperature, salt concentration and 
hybridization buffer composition. High stringency conditions for hybridization include high 
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temperature and/or low salt concentration (saits include MaCl and Nas-citrate) and/or the 
inciusion of formamide in the hybridization buffer and/or lowering the concentration of 
compounds such as SDS (detergent) in the hybridization buffer and/or exclusion of 
compounds such as dextran sulfate or polyethylene glycol (promoting molecular 

5 crowding) from the hybridization buffer. Conventional hybridization conditions are 
described in e.g. Sambrook et al. (1989) but the skilled craftsman will appreciate that 
numerous different hybridization conditions can be designed in function of the known or 
the expected homology and/or length of the nucleic acid sequence. Sufficiently low 
stringency hybridization conditions are particularly prefen-ed to isolate nucleic acids 

10 heterologous to the DMA sequences of the invention defined supra. Elements 
contributing to said heterology include alielism, degeneration of the genetic code and 
differences in preferred codon usage as discussed supra. 

Clearly, the current invention embodies the use of the inventive DNA sequences 
encoding a LRR receptor-like kinase, homoiog, derivative and/or immunofoglcaily 
15 fragment thereof as defined higher in any method of hybridization . The current invention 
furthermore also relates to DNA sequences specifically hybridizing to said inventive DNA 
sequences. 

DNA sequences as defined in the current invention can be interrupted by intervening 
sequences. With "intervening sequences" is meant any nucleic acid sequence which 

20 disrupts a coding sequence comprising said inventive DNA sequence or which disrupts 
the expressible format of a DNA sequence comprising said Inventive DNA sequence. 
Removal of the intervening sequence restores said coding sequence or said expressible 
format. Examples of Intervening sequences include introns, mobilizable DNA sequences 
such as transposons and DNA tags such as e.g. a T-DNA. With "mobilizable DNA 

25 sequence" is meant any DNA sequence that can be mobilized as the result of a 
recombination event. 

To effect expression of a protein in a ceil, tissue or organ, preferably of plant origin, 
either the protein may be introduced directly to said cell, such as by microinjection or 
ballistic means or alternatively, an isolated nucleic acid molecule encoding said protein 
30 may be introduced into said cell, tissue or organ in an expressible format. 

Preferably, the DNA sequence of the invention comprises a coding sequence or open 
reading frame (ORF) encoding a LRR receptor-iike kinase or a homoiog or derivative 
thereof or an immunologically active fragment thereof as defined supra. The preferred 
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protein of the invention comprises tiie amino acid sequence of said LRR receptor-iil^e 
kinase. 

Witii "vector" or "vector sequence" is meant a DNA sequence wliich can be introduced in 
an organism by transformation and can be stably maintained in said organism. Vector 
maintenance is possible in e.g. cultures of Escherichia coli, A. tumefaciens, 
Saccharomyces cerevisiae or Schizosaccharomyces pombe. Other vectors such as 
phagemids and cosmid vectors can be maintained and multiplied in bacteria and/or 
viruses. Vector sequences generally comprise a set of unique sites recognized by 
restriction enzymes, the multiple cloning site (MCS), wherein one or more non-vector 
sequence(s) can be inserted. 

With "non-vector sequence" is accordingly meant a DNA sequence which fs integrated in 
one or more of the sites of the MCS comprised within a vector. 

"Expression vectors" form a subset of vectors which, by virtue of comprising the 
appropriate regulatory sequences enabling the creation of an expressible format for the 
inserted non-vector sequence(s), thus allowing expression of the protein encoded by 
said non-vector sequence(s). Expression vectors are known in the art enabling protein 
expression in organisms including bacteria (e.g. E. coli), fungi {e.g. S. cerevisiae, S. 
pombe, Pichia pastoris), insect cells (e.g. baculoviral expression vectors), animal cells 
(e.g. COS or CHO cells) and plant cells (e.g. potato virus X-based expression vectors, 
see e.g. Vance at al. 1 998 - WO9844097). 

The current invention clearly includes any vector or expression vector comprising a non- 
vector DNA sequence comprising the promoter sequence according to the present 
invention or a non)vector sequence encoding a LRR receptor-like kinase, homolog, 
derivative and/or immunologically active fragment thereof as defined supra. 
As an alternative to expression vector-mediated protein production in biological systems, 
chemical protein synthesis can be applied. Synthetic peptides can be manufactured in 
solution phase or in solid phase. Solid phase peptide synthesis (Merrifield 1963) is, 
however, the most common way and involves the sequential addition of amino acids to 
create a linear peptide chain. Solid phase peptide synthesis includes cycles consisting of 
three steps: (i) immobiiization of the carboxy-terminai amino acid of the growing peptide 
chain to a solid support or resin; (ii) chain assembly, a process consisting of activation, 
coupling and deprotection of the amino acid to be added to the growing peptide chain; 
and (iii) cleavage involving removal of the completed peptide chain from the resin and 
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removal of the protecting groups from the amino acid side chains. Common approaches 
in solid phase peptide synthesis include Fmoc/tBu (9-fluorenylmethyloxycarbonyi/t-butyi) 
and Boc (t-butyloxycarbonyl) as the amino-terminal protecting groups of amino acids. 
Amino acid side chain protecting groups include methyl (Me), fornr\yl (CHO), ethyl (Et), 

5 acetyl (Ac), t-butyl (t-Bu), anisyl, benzyl (Bzl), trifluroacetyl (Tfa), N-hydroxysuccinimide 
(ONSu, OSu), benzoyl (Bz), 4-methylbenzyl (Meb), thioanizyl, thiocresyl, 
benzyioxymethyi {Bom), 4-nitrophenyl (ONp), benzyloxycarbonyl (Z), 2-nitrobenzoyl 
(NBz), 2-nitrophenylsu!phenyi (Nps), 4-toiuenesulphonyl (Tosyl.Tos), pentafluorophenyl 
(Pfp), diphenylmethyl (Dpm), 2-chlorobenzyloxycarbonyl (Cl-Z), 2,4,5-trichlorophenyl, 2- 

10 bromobenzyloxycarbonyi (Br-2), tripheylmethyl (Trityl, Trt), and 2,5,7,8-pentamethyl- 
chroman-6-su!phony! (Pmc). During chain assembly, Fmoc or Boc are removed resulting 
in an activated amino-terminus of the amino acid residue bound to the growing chain. 
The carboxy-terminus of the incoming amino acid is activated by conversion into a highfy 
reactive ester, e.g. by HBTU. With current technoiogies (e.g. PerSeptive E^sosystems 

15 9050 synthesizer, Applied Biosystems Model 431 A Peptide Synthesizer), linear peptides 
of up to 50 residues can be manufactured. A number of guidelines is available to 
produce peptides that are suitable for use in biological systems including (i) limiting the 
use of difficult amino acids such as cys, met, trp (easily oxidized and/or degraded during 
peptide synthesis) or arg; (ii) minimize hydrophobic amino acids (can impair peptide 

20 solubility); and (iii) prevent an amino-terminal glutamic acid (can cyclize to 
pyroglutamate). 

By "expressible format" is meant that the isolated nucleic acid molecule is in a form 
suitable for being transcribed into mRNA and/or translated to produce a protein, either 
constitutively or following induction by an intracellular or extracellular signal, such as an 

25 environmental stimulus or stress (mitogens, anoxia, hypoxia, temperature, salt, light, 
dehydration, etc) or a chemical compound such as iPTG (isopropyl-P-D- 
thiogalactopyranoside) or such as an antibiotic (tetracycline, ampicillin, rifampicin, 
kanamycin), hormone (e.g. gibberellln, auxin, cytokinln, giuGocorticoid, brasslnosteroid, 
ethylene, abscisic acid etc), hormone analogue (iodoacetic acid (lAA), 2,4-D, etc), metal 

30 (zinc, copper, iron, etc), or dexamethasone, amongst others. As will be known to those 
skilied in the art, expression of a functional protein may also require one or more post- 
translationai modifications, such as giycosyiation, phosphorylation, dephosphorylation, 
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or one or more protein-protein interactions, amongst others. A!! such processes are 
included within the scope of the term "expressible format". 

Preferably, expression of a protein in a specific cell, tissue, or organ, preferably of plant 
origin, is effected by introducing and expressing an isolated nucleic acid molecule 
5 encoding said protein, such as a cDNA molecule, genomic gene, synthetic 
oligonucleotide molecule, mRNA molecule or open reading frame, to said cell, tissue or 
organ, wherein said nucleic acid molecule is placed operably in connection with suitable 
regulatory sequences including a promoter, preferably a plant-expressible promoter, and 
a terminator sequence. 

10 "Regulatory sequence" refers to control DNA sequences which are necessary to affect 
the expression of coding sequences to which they are iigated. The nature of such control 
sequences differs depending upon the host organism. !n prokaryotes, control sequences 
generally include promoters, ribosomal binding sites, and terminators. In eukaryotes 
generally control sequences include promoters, terminators and enhancers or silencers. 

15 The term "control sequence" is intended to include, at a minimum, all components the 
presence of which are necessary for expression, and may also include additional 
advantageous components and which determines when, how much and where a specific 
gene is expressed. 

Reference herein to a "promoter" Is to be taken in its broadest context axvi includes the 
20 transcriptional regulatory sequences derived from a classical eukaryotic genomic gene, 
including the TATA box which is required for accurate transcription initiation, with or 
without a CCAAT box sequence and additional regulatory elements- (i.e. upstream 
activating sequences, enhancers and silencers) which alter gene expression in response 
to developmental and/or external stimuli, or in a tissue-specific manner. 
25 The term "promoter" also includes the transcriptional regulatory sequences of a classical 
prokaryotic gene, in which case it may include a -35 box sequence and/or a -10 box 
transcriptional regulatory sequences. 

The term "promoter" is also used to describe a synthetic or fusion molecule, or derivative 
which confers, activates or enhances expression of a nucleic acid molecule in a cell, 

30 tissue or organ. 

The term "functional part" when referred to nucleic acids in relation to the above defined 
transcriptional regulatory sequences, refers to a part or parts of the nucleic acid having 
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the activity to specificaliy drive or promote transcription from sequences which are 
located downstream of said nucieic acid sequence. 

Promoters may contain additionai copies of one or more specific regulatory elements, to 
further enhance expression and/or to alter the spatiai expression and/or temporal 

5 expression of a nucleic acid molecule to which it is operably connected. Such regulatory 
elements may be placed adjacent to a heterologous promoter sequence to drive 
expression of a nucleic acid molecule in response to e.g. copper, glucocorticoids, 
dexamethasone, tetracycline, gibberellin, cAMP, abscisic acid, auxin, wounding, 
ethylene, jasmonate or salicylic acid or to confer expression of a nucleic acid molecule to 

10 specific cells, tissues or organs such as meristems, leaves, roots, embryo, flowers, 
seeds or fruits. 

In the context of the present invention, the promoter preferably Is a plant-expressible 
promoter sequence. Promoters, however, that also function or solely function in non- 
plant cells such as bacteria, yeast ceils, insect celis and animal cells are not excluded 

15 from the invention. By "pfant-expressible" is meant that the promoter sequence, including 
any additional rGgulatory elements added thereto or coritained therein, is at [east 
capable of inducing, conferring, activating or enhancing expression in a plant cell, tissue 
or organ, preferably a monocotyledonous or dicotyledonous plant ceil, tissue, or organ. 
The terms "plant-operable" and "operable in a plant" when used herein, in respect of a 

20 promoter sequence, shall be taken to be equivalent to a plant-expressible promoter 
sequence. 

in the present context, a "regulatable promoter sequence" is a promoter that is capable 
of conferring expression on a structural gene in a particular cell, tissue, or organ or group 
of cells, tissues or organs of a plant, optionally under specific conditions, however does 

25 generally not confer expression throughout the plant under all conditions. Accordingly, a 
regulatable promoter sequence may be a promoter sequence that confers expression on 
a gene to which it is operably connected in a particular location within the plant or 
alternatively, throughout the plant under a specific set of conditions, such as following 
induction of gene expression by a chemical compound or other ellcitor. 

30 Preferably, the regulatable promoter used in the performance of the present invention 
confers expression in a specific location within the plant, either constitutively or following 
induction, however not in the whole plant under any circumstances. Included within the 
scope of such promoters are cell-specific promoter sequences, tissue-specific promoter 
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sequences, organ-specific promoter sequences, eel! cycle specific gene promoter 
sequences, inducible promoter sequences and constitutive promoter sequences that 
liave been modified to confer expression in' a particular part of tine plant at any one time, 
such as by integration of said constitutive promoter within a transposable genetic 
element {Ac, Ds, Spm, En, or other transposon). The skilled craftsman will understand 
that a "constitutive promoter" is a promoter that is transcriptionaly active in an organism, 
preferably a plant, during most, but not necessarily all phases of its growth and 
development. Similarly, the skilled craftsman will understand that a "ubiquitous promoter" 
is a promoter that is transcriptionally active throughout most, but not necessarily all parts 
of an organism, preferably a plant. 

Generally by "weak promoter" is intended a promoter that drives expression of a coding 

sequence at a low level. By "low level" is intended at levels of about 1/10,000 transcripts 
to about 1/100,000 transcripts, to about 1/500,0000 transcripts. Conversely, a "strong 
promoter" drives expression of a coding sequence at high ievei, or at about 1/10 
transcripts to about 1/100 transcripts to about 1/1 ,000 transcripts. 
The term "cell -specific" shali be taken to Indicate that expression is predominantly in a 
particular cell or celi-type, preferably of plant origin, albeit not necessarily exclusively in 
said cell or cell-type. 

Similarly, the term "tissue-specific" shall be taken to indicate that expression is 
predominantly in a particular tissue or tissue-type, preferably of plant origin, albeit not 
necessarily exclusively in said tissue or tissue-type. 

Similarly, the term "organ-specific" shall be taken to indicate that expression is 
predominantly in a particular organ, preferably of plant origin, albeit not necessarily 
exclusively in said organ. "Root specific" means that the promoter is expressed in the 
root only and not in other tissues of the plant. 

By "root-preferred" it is intended that expression of the heterologous nucleotide 
sequence is most abundant root, but could also have low expression levels elsewhere in 
the plant. While some level of expression of the heterologous nucleotide sequence 
occurs in other plant tissue types, expression occurs most abundantly In the root 
including primary, lateral and adventitious roots. 

By "roof is intended any part of the root structure, including, but not limited to, the root 
cap, apical meristem, protoderm, ground meristem, procambium, endodermis, cortex, 
vascular cortex, epidermis, and the like. 
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Sims!ar!y, the term "cell cycle specific" shall be taken to indicate that expression is 
predominantly cycilc and occurring in one or more, not necessarily consecutive phases 
of the cell cycle albeit not necessarily exclusively in cycling cells, preferably of plant 
origin. 

5 Contrarily, the term ubiquitous shall be taken to indicate that expression is throughout 
the entire oranism. 

Those skilled in the art will be aware that an "inducible promoter" is a promoter the 
transcriptional activity of which is increased or induced In response to a developmental, 
chemical, environmental, or physical stimulus. Similarly, the skilled craftsman will 

10 understand that a "constitutive promoter' is a promoter that is transcriptionally active 
throughout most, but not necessarily all parts of an organism, preferably a piant, during 
most, bsjt not necessarily all phases of its growth and development. 
Those skilled in the art will readily be capable of selecting appropriate promoter 
sequences for use in regulating appropriate expression of the LRR receptor-like kinase 

15 as described supra from publicly-availabie or readily-avaslabie sources, without undue 
experimentation. 

Placing a nucleic acid molecule under the regulatory controi of a promoter sequence, or 
in operable connection with a promoter sequence, means positioning said nucleic acid 
molecule such that expression is controlled by the promoter sequence. A promoter is 

20 usually, but not necessarily, positioned upstream, or at the 5'-end, and within 2 kb of the 
start site of transcription, of the nucleic acid molecule which it regulates. In the 
construction of heterologous promoter/structural gene combinations it is generally 
preferred to position the promoter at a distance from the gene transcription start site that 
is approximately the same as the distance between that promoter and the gene it 

25 controls in its natural setting (i.e., the gene from which the promoter is derived). As is 
known in the art, some variation in this distance can be accommodated without loss of 
promoter function. Similarly, the preferred positioning of a regulatory sequence element 
with respect to a heterologous gene to be placed under its control is defined by the 
positioning of the element In its natural setting (i.e., the gene from which it is derived). 

30 Again, as is known in the art, some variation in this distance can also occur. 

"Expression" means the production of a protein or nucleotide sequence in the cell itself 
or in a cell-free system. It includes transcription into an RNA product, post-transcriptional 
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modification and/or translation to a protein product or polypeptide fronn a DNA encoding 
that product, as well as possible post-translational modifications. 
"Operably linked" refers to a juxtaposition wherein the components so described are in a 
relationship pemnitting them to function in their intended manner. A control sequence 
5 "operably linked" to a coding sequence is ligated in such a way that expression of the 
coding sequence is achieved under conditions compatible with the control sequences. In 
case the control sequence is a promoter, it Is obvious for a skilled person that double- 
stranded nucleic acid is preferably used. 

Examples of promoters suitable for use in gene constructs of the present invention 

10 include those listed In Table 3, amongst others. The promoters listed in Table 5 are 
provided for the purposes of exemplification only and the present invention is not to be 
limited by the list provided therein. Those skilled in the art will readily be in a position to 
provide additional promoters that are useful in performing the present invention. 
In the case of constitutive promoters or promoters that induce expression throughout the 

15 entire plant, it is preferred that such sequences are modified by the addition of 
nucleotide sequences derived from one or more of the tissue-specific promoters listed in 
Table 8, or alternatively, nucleotide sequences derived from one or more of the above- 
mentioned tissue-specific inducible promoters, to confer tissue-specificity thereon. For 
example, the CaMV 35S promoter may be modified by the addition of maize Adhi 

20 promoter sequence, to confer anaerobically-regulated root-specific expression thereon, 
as described previously (Ellis ef a/,, 1987). Another example describes conferring root 
specific or root abundant gene expression by fusing the CaMV35S promoter to elements 
of the maize glycine-rich protein GRP3 gene (Feix and Wulff 2000 - WO0015662). Such 
modifications can be achieved by routine experimentation by those skilled in the art. 

25 The term "terminator" refers to a DNA sequence at the end of a transcriptional unit which 
signals termination of transcription. Terminators are 3'-non-translated DNA sequences 
containing a polyadenylation signal, which facilitates the addition of polyadenylate 
sequences to the 3'-end of a primary transcript. Terminators active in cells derived from 
viruses, yeasts, moulds, bacteria, insects, birds, mammals and plants are known and 

30 described in the literature. They may be isolated from bacteria, fungi, viruses, animals 
and/or plants. 

Examples of term.lnators particularly suitable for use in the gene constructs of the 
present invention include the Agrobacterium tumefaciens nopaline synthase (NOS) gene 
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terminator, the Agrobacten'um tumefaciens octopine synthase (OCS) gene terminator 
sequence, the Cauliflower mosaic virus (CaMV) 35S gene terminator sequence, the 
Oryza sativa ADP~giucose pyrophosphorylase terminator sequence (t3'Bt2), the Zea 
mj>'s zein gene terminator sequence, the rbcs-IA gene terminator, and the rbcs-3A 
s gone terminator sequences, amongst others. 
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Table 3. Exemplary plant-expressible promoters for use 
in the performance of the present invention 



!: CELL-SPECIFIC,T!SSUE-SPECIRC, AND ORGAN-SPECIFiC PROMOTERS 


GENE SOURCE 


EXPRESSION 
PATTERN 


REFERENCE 


a-amyiase {Amy32b) 


Aleurone 


Lanahan et al, Plant Cell 4:203-21 1, 1992; 
Skriver et al, Proc Natl Acad Scl USA 
88:7266-7270,1991 


cathepsin p-iike 
gene 


Aleurone 


Cejudo et al. Plant Mo! Biol 20:849-856. 
1992 


Agrobacterium 
rhizogenes rolB 


Cambium 


Nilsson et al, Physiol Plant 100:456-462, 
1997 


AtPRP4 


Flowers 


httD://salus.medium.edu/mma/tiernev/html 


chalcone synthase 
(chsA) 


Flowers 


Van der Meer et ai. Plant Mo! Biol 1 5:95- 
109.1990 


LAT52 


Anther 


Twell et al, Mol Gen Genet 217:240-245, 
1989 


apetala-3 


Flowers 




chilinase 


fruit (berries, grapes, 
etc) 


Thomas et al. CSIRO Plant Industry, 
Urrbrae, South Australia, Australia; 
htlD://winetitles.com.au/awrdc/csh95-1.html 


rbcs-3A 


green tissue (eg leaf) 


Lam et al. Plant Cell 2:857-866, 1990; 

Tucker et al., Plant Physiol 11 3:1 303-1 308, 

1392 


leaf-specific genes 


Leaf 


BaszGzynski et a\, hJucI Acid Res 16:4732, 
1988 


AtPRP4 


Leaf 


httD://salus, medium . edu/mmo/tiernev/htm 1 


chiorella virus 

adenine 

methyltransferase 
gene promoter 


Leaf 


Mitra and Higglns, Plant Mol Biol 26:85-93, 
1994 


aldP gene promoter 
from rice 


Leaf 


Kagaya et ai, Mo! Gen Genet 248:668-674, 
1995 
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rbcs promoter from 
rice or tomato 


Leaf 


Kyozuka et al, Plant Physio! 102:991-1000. 

1993 


Pinus cab-6 


Leaf 


Yamamoto et al. Plant Cell Physiol 35:773- 
778, 1934 


rubisco promoter 


Leaf 




cab (chiorophyll 
a/b/blnding protein 


Leaf 




pea B!ec4 gene 


Vegetative and floral 
epidermal tissues 


Mandaci and Dobres, Plant Moi Biol 
34:961-965 


SAM22 


senescent leaf 


Croweil et al. Plant Moi Biol 18:459-466, 
1992 


Itp gene (lipid 
transfer gene) 




Fieming et al, Riant J 2:855-862, 1992 


R. japonicum nif 
ger>e 


noduie 


United States Patent No 4 803165 


B. japonicum nifH 
gene 


nodule 


United States Patent No 5008194 


GmENOD40 


noduie 


Yang et ai, Plant J 3:573-585. 1993 


PEP carboxylase 
(PEPC) 


nodule 


Pathirana et al, Plant Moi Biol 20:437-450, 

1992 


Leghaemogiobin 
(Lb) 


nodule 


Gordon et al, J Exp Bot 44:1453-1465, 

1993 


Tungro bacilUform 
virus gene 


phloem 


Bhattacharyya- Pakrasi et al. Plant J 4:71- 
79, 1992 


pollen-specific genes 


pollen; microspore 


Albani et al, Plant Moi Biol 15:605, 1990; 
Albani et al, Plant Moi Biol 16:501 , 1 991 


Zm13 


pollen 


Guerrero et al, Moi Gen Genet 224:161- 
168, 1993 


apg gene 


microspore 


Twell et al. Sex Plant Reprod 6:217-224, 
1993 


maize pollen-specific 
gene 


pollen 


Hamilton et al. Plant Moi Biol 18:211-218, 
1992 


Sunflower pollen- 
expressed gene 


pollen 


Baltz et al. Plant J 2:713-721, 1992 


B. napus pollen- 


poilen;anther; tapetum 


Arnoldo et ai, J Cell Biochem, Abstract No. 
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specific gene 




Y101,204, 1992 


root-expressible 
genes 


roots 


Tingey et al, EMBO J 6:1 , 1987 


Tobacco auxin- 
inducible gene 


root tip 


Van der Zaai et a!, Plant Mol Bioi 16:983, 
1991 


p-tubu!in 


root 


Oppenheimer et al, Gene 63:87, 1988 


Tobacco root- 
specific genes 


root 


Conkling et al, Plant Physiol 93:1203, 1990 


B. napusGI-Sb 
gene 


root 


United States Patent No 5401836 


SbPRPI 


roots 


Suzuki et al, Plant Mol Biol 21:109-1 19, 

1993 


AtPRPI; AtPRP3 


roots; root hairs 


httD://sa!us.medium.edu/mr!^q/tiernev/htm! 


RD2 gene 


root cortex 


http 


//wvvw2 .ens u .ed u/ncsu/research 


TobRB7 gene 


root vasculature 




//www2.cnsu.8du/ncsu/res8arch 


AtPRP4 


leaves; flowers; lateral 
root primordia 




//salus.medium.edu/rnmq/tiernev/hlml 


seed-specific genes 


seed 


Simon etal, Plant Moi Bioi 5:191 , 1385; 
Scofield et al, J Biol Chem 262:12202, 
1987; Baszczynski et al, Plant Mol Biol 
14:633, 1990 


Brazii Nut albumin 


seed 


Pearson et al, Plant Moi Bioi 18:235-245, 
1992 


Legumin 


seed 


Ellis et al, Plant Mol Bioi 10:203-214, 1988 


Glutelin (rice) 


seed 


Takaiwa et al, fvlol Gen Genet 208:1 5-22, 
1986; Takaiwa et al, FEBS Lett 221 :43-47, 
1987 


Zein 


seed 


Matzke et al, Plant Mol Biol 14:323-32 1990 


Nap A 


seed 


Stalberg et al, Planta 199:515-519, 1996 


wheat LMW and 
HMW giutenin-1 


endosperm 


Moi Gen Genet 216:81-90, 1989; Nuci 
Acids Res 17:461-462, 1989 


wiieat SPA 


seed 


Aibani et al, Plant Ceil 9:171-184, 1997 


CZ19B1. maize 19 
kDa zein 


seed 


WOG011177 


mil ps, maize 


seed 


WO0011177 
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myoinositoi-1 -Pi 
synthase 






wheat a, p, y-gliadsns 


endosperm 


EMBO J 3:1409-1415, 1984 


barley ltr1 promoter 


endosperm 




barley B1 , C, D, 
hordein 


endosperm 


Theor App! Gen 98:1253-1262, 1999; Plant 
J 4:343-355, 1993; Moi Gen Genet 
250:750-60, 1996 


barley DOF 


endosperm 


fvlena et al, Riant J 116:53-62, 1998 


biz2 


endosperm 


EP991 0605S.7 


synthetic promoter 


endosperm 


Vlcente-Carbajosa et a!. Plant J 13:629- 
640, 1998 


rice proiamin NRPSS 


endosperm 


Wu et ai, Plant Celi Physio! 39: 885-889, 
1998 


rice a-giobulin Glb-1 


endosperm 


Wu et ai, Plant Celi Physiol 39:885-889, 

1998 


maize EMD genes 


endosperm 


WO0012733 


barley END1 


endosperm 


WO3808S61 


barley NUC1 


nucelius 


WO9808961 


rice 0SH1 


embryo 


Sato et ai, Proc Natl Acad Sci USA 
93:8117-8122,1996 


rice a-giobulin 
REB/OHP-1 


endosperm 


Nakase et al, Plant Moi Biol 33:513-522, 


rice ADP-glucose PP 


endosperm 


Trans Res 6:157-168, 1997 


maize ESR gene 
famiiy 


endosperm 


Plant J 1 2:235-246, 1 997 


sorgum "y-kafirin 


endosperm 


Plant Moi Biol 32:1029-1035, 1996 


KNOX 


embryo 


Postma-Haarsma et al. Plant Moi Biol 
39:257-271 , 1 999 


rice oleosin 


embryo and aieuron 


Wu et ai, J Biochem 123:386, 1998 


sunflower oieosiri 


seed (embryo and dry 


Cummins et al, Plant Mo! Biol 19:873-876, 
1992 


LEAFY 


siioot meristem 


Weigel et al. Cell 69:843-859, 1992 


Arabidopsis thailana 
knati 


shoot meristem 


Accession number AJ1 31 822 


Malus domestica 


r shoot meristem 


Accession number Z71 981 
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CLAVATA1 


shoot meristem 


Accession number AFQ49870 


stigma-specific 


stigma 


Nasrailah et al, Proc Natl Acad Sci USA 
85:5551 , 1988; Trick et al, Plant Mol Biol 
15:203, 1990 


class 1 patatin gene 


tuber 


Liu et al. Plant Mol Biol 153:386-395, 1991 


PCNA rice 


meristem 


Kosugi et al, NucI Acids Res 19:1571-1576, 

9:1607-1619, 1997 


Pg3 TuIdAI tUDUiin 


DtvidinQ colls 


Qt/^f7 anri J nnn Pliant f^Af^l Rinl Ai 'f^fM ^RiA 

1999 


ArBt}i(Jopsis cdcSs 




1995 


Ardbfcfops/sRQDlA 


-i- poilsn tub6s 


Li et als Plar^t Physiol 118'4Q7-417 1998 


Arabidopsis AtDMCI 


Meiosis-associated 


Kiimyuk and Jones, Plant J 11:1-14, 1997 


Pea PS-IAA4/5 and 
PS-IAA6 


Auxin-inducibje 


Wnnn ot al PIstni 1 Q-I=^a7-^QQ 100^% 
VVUny r^i al, "IctiU J «7.00/ 0\79, 10^70 


Pea 

farnesyltransferase 


MeristematiG tissues; 
phioem near growing 
tissues^ light- and 
sugar-repressed 


Zhou et al. Plant J 1 2:921 -930,1 997 


Tobacco (N. 
syivQstrls^ eye! in 
B1;1 


Dividing ceils / 


Trehin et al. Plant Mol.Biol. 35:667-672, 


Catharanthus roseus 
Mitotic cyclins CYS 
(A-type) and CYM 
(B-type) 


Dividing cells / 
m.eristematic tissue 


!to et al, Plant J 1 1 :983-992, 1 997 


Arabidopsis cycl At 
^=:cyc B1 j1) and 
cyc3aAt (A-type) 


Dividing ceils / 
meristematic tissue 


Shaul et al, Proc Natl Acad Sci USA 


AfBbidopsis tef 1 
promoter box 


DividinQ colls / 
meristematic tissue 


RofinH ot al Mol dan Cianai 0/lQ-7nO "yi I 

ntiyau CI al, iviui «en vaenei tLHO./uo-ii \, 
1995 


Catharanthus roseus 


Dividing cells / 


Ito St a!, Plant Mo! Biol 24:863-878, 1994 
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cyc07 1 meristematic tissue | 


II: EXEMPLARY CONSTITUTIVE PROMOTERS 


GENE SOURCE 


EXPRESSION 
rAI ICHN 


REFERENCE 


Actin 


constitutive 


McElroy et al, Plant Cell 2:163-171 , 1990 


CAMV 35S 


constitutive 


Odell et al. Nature 313:810-812, 1985 


CaMV19S 


constitutive 


Nilsson et al, Physiol Plant 100:456-462, 

1997 


GOS2 


constitutive 


de Pater et al, Plant J 2:837-844, 1992 


Ubiquitin 


constitutive 


Christensen et al, Plant Mol Biol 18:675- 

689, 1992 


rice cyclophilin 


constitutive 


Buchhoiz et al. Plant Mol Biol 25:837-843, 

1994 


maize hlstone H3 


constitutive 


Lepetit et al, Mol Gen Genet 231 :276-285, 

1992 


alfalfa histone H3 


constitutive 


Wu et ai, Nucleic Acids Res 17:3057-3063, 
1989; Wu et al. Riant Mol Bio! 1 1 :641-649, 
1988 


actin 2 


constitutive 


An etai, Plant J 10:107-121, 1996 


III: EXEMPLARY STRESS-INDUCIBLE PROMOTERS 


NAME 


STRESS 


REFERENCE 


P5CS (delta(1)- 

pyrroline-5- 
carboxylate syntase) 


salt, water 


Zhang et ai. Plant Sci 129:81-89, 1997 


corlSa 


coid 


Hajela et al, Plant Physiol 93:1246-1252, 
1990 


con 5b 


cold 


Wlihelm et al. Plant Mol Biol 23:1073-1077, 
1993 


cor1 5a (-305 to +78 
nt) 


cold, drought 


Baker et al, Plant Mol Biol 24: 01-713. 1994 


rci29 


salt, drought, cold 


Kasuga et al, Nature Biotechno! 18:287- 
291, 1999 


heat shoGl< proteins, 


heat 


Barros et al, Plant Moi Bioi 19 665-75. 
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including artificial 
promoters containing 
the heat shock 
element (HSE) 




1992. Marrs at a!, Dev Genetl 4:27-41 , 

1993. Schoffi et a!, Mol Gen Genet 
217:246-53, 1989. 


SmHSP (smali heat 
shock proteins) 


heat 


Waters et al, J Exp Bot 47:325-338, 1996 


WCS120 


cold 


Oueiiete et ai. FEES Lett 423:324-328, 

1998 


Gi7 


cold 


Kirch et a!, Plant Mol Biol 33:897-909, 

1997 


Adh 


cold, drought, hypoxia 


Doirerus etal, Plant Physiol 105:1075-87, 
1994 


pwsilS 


salt and drought 


Joshee et al, Plant Cell Physiol 39:64-72, 
1998 


ci21A 


cold 


Schneider et al. Plant Physiol 113:335-45, 

1997 


Trg-31 


drought 


Ghaudhary et al, Plant Mol Biol 30:1247-57, 
1996 


Osmotin 


osmotic 


Raghothama et al, Plant Mol Biol 23:1 117- 
28, 1993 


LapA 


wounding, 
enviromental 


WO99/03977 University of Californla/INRA 


IV: EXEMPLARY PATHOGEN-INDUCIBLE PROMOTERS 


NAME 


PATHOGEN 


REFERENCE 


RB7 


Root-knot nematodes 
(Meioidogyne spp.) 


US5760386 - North Carolina State 

University; Opperman et a!, Science 
263:221-23, 1994 


PR-1,2, 3, 4, 5, 8, 11 


fungal, viral, bacterial 


Ward et ai, Piant Cefi 3:1085-1094, 1991 ; 
Reiss et al 1996; Lebel et al, Plant J 
16:223-233, 1998; Melchers etal, Plant J 
5:469-480, 1994; La'Mon et al, Plant Mol 
Biol, 19:735-743, 1992 


HMG2 


nematodes 


WO9503690 - Virginia Tech Intellectual 
Properties inc . 


Abi3 


Cyst nematodes 


unpublished 
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(rieterodera spp.) 




ARM1 


nematodes 


Bartheis et al, Plant Cell 9:211 9-2134, 1997 
WO 68/31822 - Piant Genetic Systems 


Att0728 


nematodes 


Bartheis et a!, Plant Ceii 9: 21 19--2134, 
1997 

PCT/EP98/07761 


Att1712 


nematodes 


Bartheis et al. Plant Cell 9, 21 19-2134, 

1997 

PCT/EP98/07761 


Gst1 


Different types of 
pathogens 


Strittmatter et al, Moi Plant-Microbe Interact 
9:68-73, 1996 


LEMMi 


nematodes 


WO 92/21 757 - Piant Genetic Systems 


CLE 


geminivirus 


PCT/EP99/03445 - CiNESTAV 


PDF1.2 


Fungal inciuding 
Alt&mana brassicicola 
and Botrytis cinarea 


Manners et al, Plant Mol Biol, 38:1071- 
1080, 1998 


Thi2.1 


Fungal - Fusarium 
oxysporum f sp. 
matthiolae 


VIgnutelli et al, Plant J 14:285-295, 1 998 


DB#226 


nematodes 


Bird and Wilson, Mol Plant-Microbe Interact 
7:419-442, 1994 
WO 95.322888 


OB#280 


nematodes 


Bird and Wilson, Mol Plant-Microbe Interact 

7:419-442,1994 

WO 95.322888 


Cat2 


nematodes 


Niebel et al, Mol Plant-Microbe Interact 
8:371-378,1995 


□Tub 


nematodes 


Aristizabal et al (1996), 8'" International 
Congress on Plant-Microbe Interaction, 
Knoxville US B-29 


SHSP 


nematodes 


Fenoll et al (1997) In: Cellular and 
molecular aspects of plant-nematode 
interactions. Kluwer Academic, C. Fenoll, 
F.M.W. Grundler and S.A. Ohl (Eds.), 


Tsw12 


nematodes 


Fenoll et al (1997) In: Cellular and 
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molecular aspects of plant-nematode 
interactions. Kiuwer Academic, C. Fenoll, 
F.M.W. Grundier and S.A. Ohi (Eds.) 


Hs1(pro1) 


nematodes 


WO 98/122335 -Jung 


NsLTP 


viral, fungal, bacterial 


Molina and Garcia-Olmedo FEBS Lett, 
316:119-122,1993 


RIP 


viral, fungal 


Tumer et al, Proc Natl Acad Sci USA 
94:3866-3871, 1997 



Those skilled in the art will be aware of additional promoter sequences and terminator 
sequences which may be suitable for use in performing the invention. Such sequences 
may readily be used without any undue experimentation. 

5 In the context of the current invention, "ectopic expression" or "ectopic overexpression" 
of a gene or a protein are conferring to expression patterns and/or expression levels of 
said gene or protein normally not occurring under natural conditions. Ectopic expression 
can be achieved in a number of ways including operably linking of a coding sequence 
encoding said protein to an isolated homoiogous or heterologous promoter in order to 

10 create a chimeric gene and/or operably linking said coding sequence to its own isolated 
promoter (i.e. the unisolated promoter naturally driving expression of said protein) in 
'• ; order to create a recombinant gene duplication or gene muitiplication effect With 
"ectopic co-expression" is meant the ectopic expression or ectopic overexpression of two 
or more genes or proteins. The same or, more preferably, different promoters are used 

.15 to confer expression of said genes or proteins. 

Preferably, the promoter sequence used in the context of the present invention is 
operably linked to a coding sequence or open reading frame (ORF) encoding a LRR 
receptor-like kinase or a homolog, derivative and/or an immunologically active fragment 
thereof as defined supra. 

20 "Dominant negative version or variant" refers to a mutant protein, which interferes with 
the activity of the corresponding wild-type protein. 

"Downregulation of expression" as used herein means lowering levels of gene 
expression and/or levels of active gene product and/or levels of gene product activity. 
Decreases in expression may be accomplished by e.g. the addition of coding sequences 
25 or parts thereof in a sense orientation (if resulting in co-suppression) or in an antisense 
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orientation relative to a promoter sequence and furthermore by e.g. insertion 
mutagenesis (e.g. T-DNA insertion or transposon insertion) or by gene silencing 
strategies as described by e.g. Angeli and Baulcombe (1998 - WO9836083), Lowe et ai. 
(1989 - VVO9853083), Lederer et ai. (1999 - W09915682) or Wang et al. (1999 - 

5 WO9953050). Genetic constructs aimed at silencing gene expression may have the 
nucleotide sequence of said gene (or one or more parts thereof) contained therein in a 
sense and/or antisense orientation relative to the promoter sequence. Another method to 
downregulate gene expression comprises the use of ribozymes, e.g. as described in 
Atkins et al. 1994 (WO9400012), Lenee et al. 1995 (WO9503404), Lutziger et al. 2000 

10 (WO000061 9), Prinsen et al. 1 997 (W0971 3865) and Scott et ai. 1 997 (W097381 1 6). 
Moduiating, including lowering, the level of active gene products or of gene product 
activity can be achieved by administering or exposing cells, tissues, organs or organisms 
to said gene product, a homotog, anaiogue, derivative and/or immunologicaliy active 
fragment thereof. Immunomodulation is another example of a technique capable of 

15 downregulation ieveis of active gene product and/or of gene product activity and 
comprises administration of or exposing to or expressing antibodies to said gene product 
to or in cells, tissues, organs or organisms wherein ieveis of said gene product and/or 
gene product activity are to be modulated. Such antibodies comprise "plantibodies", 
single chain antibodies, IgG antibodies and heavy chain camel antibodies as well as 

20 fragments thereof. 

Modulating, including lowering, the level of active gene products or of gene product 
activity can futhermore be achieved by administering or exposing cells, tissues, organs 
or organisms to an inhibitor or activator of said gene product or the activity thereof. Such 
inhibitors or activators include proteins (comprising e.g. proteinases and kinases) and 

25 chemical compounds Identified according to the current invention as described supra. 
In the context of the invention the term "agonisf ' refers to a substance that can be either 
a protagonist or an antagonist. I.e. can have either positive or negative effects, can be 
an enhancer or an inhibitor or a modulator as well. 

In the context of the current invention is envisaged the downregulation of the expression 
30 of a LRR receptor-like kinase gene as defined higher. The invention further comprises 

downregulation of levels of a LRR receptor-like kinase or of a LRR receptor-like kinase 
activity whereby said LRR receptor-like kinase has been defined supra. 
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By "ceil fate and/or plant development and/or plant morphology and/or biochemistry 
and/or physiology" is meant that one or more developmental and/or morphological 
and/or biochemical and/or physiological characteristics of a plant Is altered by the 
performance of one or more steps pertaining to the invention described herein. 
5 "Ceil fate" refers to the cell-type or cellular characteristics of a particular cell that are 
produced during plant development or a cellular process therefor, in particular during the 
cell cycle or as a consequence of a ceil cycie process. 

"Plant deveiopmenf or the term "plant developmental characteristic" or similar term 
shall, when used herein, be taken to mean any cellular process of a plant that is involved 

10 in determining the deveiopmentai fate of a plant ceil, in particular the specific tissue or 
organ type into which a progenitor ceil will develop. Cellular processes relevant to plant 
development will be known to those skilied in the art. Such processes include, for 
example, morphogenesis, photomorphogenesis. shoot development, root development, 
vegetative development, reproductive development, stem elongation, flowering, and 

15 regulatory mechanisms involved in determining cell fate, in particular a process or 
regulatory process involving the cell cycle. 

"Plant morphology" or the term "plant morphological characteristic" or similar term will, 
when used herein, be understood by those skilled in the art to refer to the external 
appearance of a plant, including any one or more structural features or combination of 

20 Structural features thereof. Such structural features Include the shape, size, number, 
position, colour, texture, arrangement, and patlernation of any cell, tissue or organ or 
groups of cells, tissues or organs of a plant, Including the root, stem, leaf, shoot, petiole, 
trichome, flower, petal, stigma, style, stamen, pollen, ovule, seed, embryo, endosperm, 
seed coat, aleurone, fibre, fruit, cambium, wood, heart wood, parenchyma, aerenchyma, 

25 sieve element, phloem or vascular tissue, amongst others. 

"Plant biochemistry" or the term "plant biochemical characteristic" or similar term will, 
when used herein, be understood by those skilied in the art to refer to the metabolic and 
catalytic processes of a plant, including priman/ and secondary metabolism and the 
products thereof, including any small molecules, macromolecules or chemical 

30 compounds, such as but not limited to starches, sugars, proteins, peptides, enzymes, 
hormones, growth factors, nucleic acid molecules, celluloses, hemiceiluloses, calloses, 
lectins, fibres, pigments such as anthocyanins, vitamins, minerals, micronutrlents, or 
macronutrients, that are produced by plants. 
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■ Plant physiology" or the term "plant physiological characteristic" or similar term will, 
when used herein, be understood to refer to the functional processes of a plant, 
including developmental processes such as growth, expansion and differentiation, 
sexual development, sexual reproduction, seed set, seed development, grain filling, 

5 asexual reproduction, cell division, dormancy, germination, light adaptation, 
pr>otosynthesis, leaf expansion, fiber production, secondary growth or wood production, 
anxxigst others; responses of a plant to externally-applied factors such as metals, 
chocnscals, hormones, growth factors, environment and environmental stress factors (eg, 
arK)xia, hypoxia, high temperature, low temperature, dehydration, light, daylength, 

10 fVooc^, salt, heavy rnetais, amongst others), including adaptive responses of plants to 
said extemaily-applied factors. 

I.*eans lor introducing recombinant DNA into plant tissue or cells include, but are not 
bmrted lo. transformation using CaCIo and variations thereof, in particular the method 
doscnbod by Hanahan (1983), direct DNA uptake into protoplasts (Krens et al, 1982; 

n PasTkowski et a!, 1984), PEG-mediated uptake to protoplasts (Armstrong et al, 1990) 
mjcroparticle bombardment, electroporation (Fromm et al, 1 985), microinjection of DNA 
(Crossway et al., 1986), microparticle bombardment of tissue explants or cells (Christou 
€rf al. 1988), vacuum-infiltration of tissue with nucleic acid, or in the case of plants, T- 
DN A -mediated transfer from Agrobacterium to the plant tissue as described essentially 

20 by An et al.(1985), Dodds et at. (1985), Herrera-Estrella et al. (1983a, 1983b). Methods 
kst transformation of monocotyledonous plants are well known in the art and include 
A^ritacterium-medtatedi transformation (Cheng et al. 1997 - W09748814,- Hansen 1998 
- W09854961, Hiei et al. 1994 - WO9400977; Hiei et al. 1998 - W09817813; Rikiishi et 
^ 1999 - WO9904618; Saito et al. 1995 - WO9506722), microprojectile bombardment 

25 (Adams et al. 1999 - US5969213; Bowen et al. 1998 - US5736369; Chang et al. 1994 - 
W094 13822; Lundquist et al. 1999 - US5874265AJS5990390; Vasil and Vasil 1995 - 
USr>405765; Walker et al. 1999 - US5955362), DNA uptake (Eya! et al. 1993 - 
W093lBt68), microinjection of Agrobacterium cells (von Holt 1994 - DE4309203) and 
sonscation (Finer et al. 1997 - US5693512). 

30 Fof rrucroparticle bombardment of cells, a microparticle is propelled into a cell to produce 
a sransformed cell. Any suitable bailistic cell transformation methodology and apparatus 
can be used in performing the present invention. Exemplary apparatus and procedures 
are disclosed by Stomp et al. (U.S. Patent No. 5122466) and Sanford and Wolf (U.S. 
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Patent No. 4945050). When using ballistic transformation procedures, the gene 
construct may incorporate a plasmid capable of replicating in the cell to be transformed. 
Examples of microparticles suitable for use in such systems include 1 to 5 ^Jm gold 
spheres. The DNA construct may be deposited on the microparticle by any suitable 

5 technique, such as by precipitation. 

A whole plant may be regenerated from the transformed or transfected cell, in 
accordance with procedures weil known in the art. Plant tissue capable of subsequent 
clonal propagation, Vi/hether by organogenesis or embryogenesis, may be transformed 
with a gene construct of the present invention and a whole piant regenerated therefrom. 

10 The particular tissue chosen will vary depending on the clonal propagation systems 
available for, and best suited to, the particular species being transformed. Exemplary 
tissue targets include leaf dis[<s, pollen, embryos, cotyledons, hypocotyls, 
megagametophytes, callus tissue, existing meristematic tissue (e.g., apical meristem, 
axillary buds, and root meristems), and induced meristem tissue (e.g., cotyledon 

15 meristem and hypocotyl meristem). 

The term "organogenesis", as used herein, means a process by which shoots and roots 
are developed sequentially from meristematic centers. 

The term "embryogenesis", as used herein, means a process by which shoots and roots 
develop together in a concerted fashion (not sequentially), whether from somatic cells or 
20 gametes. 

Preferably, the plant is produced according to the inventive method is transfected or 
transformed with a genetic sequence, or amenable to the introduction of a protein, by 
any art-recognized means, such as microprojectiie bombardment, microinjection, 
Agrobacterium-mediaied transformation (including the 'flower dip' transformation 
25 method; Bechtoid and Pelietier 1998, Trieu et al. 2000), protopiast fusion, or 
electroporation, amongst others. Most preferably said plant is produced by 
/4grodacter/u/77-mediated transformation. 

The "seedling" is the juvenile plant that arises from the mature embryo after seed 
germination. 

30 With "differentiation of a cell" it is understood that the cell develops unique features to 
be engaged for a specific function. Mostly differentiation is irreversible. 
/Agrobactentvm-mediated transformation or agrolistic transformation of plants, yeast, 
moulds or filamentous fungi is based on the transfer of part of the transformation vector 
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sequences, called the T-DNA, to the nucleus and on integration of said T-DNA in the 

genome of said eukaryote. 

With "Agrobaoterium" is meant a member of the Agrobacteriaceae, more preferably 
Agrobacterium or Rhizobacterium and most preferably Agrobacterium tumefaclens. 
5 With "T-DNA", or transferred DNA, is meant that part of the transformation vector flanked 
by T-DNA borders which is, after activation of the Agrobacterium wr genes, nicked at the 
T-DNA borders and Is transferred as a single stranded DNA to the nucleus of an 
eukaryotic cell. 

When used herein, with "T-DNA borders", "T-DNA border region", or "border region" are 

10 meant either right T-DNA border (RB) or left T-DNA border (LB). Such a border 
comprises a core sequence flanked by a border inner region as part of the T-DNA 
flanking the border and/or a border outer region as part of the vector backbone flanking 
the border. The core sequences comprise 22 bp in case of octopine-type vectors and 25 
bp in case of nopaline-type vectors. The core sequences in the right border region and 

15 left border region form imperfect repeats. Border core sequences are indispensable for 
recognition and processing by the Agrobacterium nicking complex consisting of at least 
VirD1 and V!rD2. Core sequences flanking a T-DNA are sufficient to promote transfer of 
said T-DNA. However, efficiency of transformation using transformation vectors carrying 
said T-DNA solely flanked by said core sequences is low. Border inner and outer regions 

20 are known to modulate efficiency of T-DNA transfer {Wang et al. 1987). One element 
enhancing T-DNA transfer has been characterized and resides in the right border outer 
region and is called overdrive (Peralta et al. 1 986, van Haaren et al. 1 987). 
With "T-DNA transformation vector" or "T-DNA vector" is meant any vector 
encompassing a T-DNA sequence flanked by a right and left T-DNA border consisting of 

25 at least the right and left border core sequences, respectively, and used for 
transformation of any eukaryotic cell. 

With "T-DNA vector backbone sequence" or "T-DNA vector backbone sequences" is 
meant all DNA of a T-DNA containing vector that lies outside of the T-DNA borders and, 
more specifically, outside the nicking sites of the border core imperfect repeats. 
30 The current invention includes optimized T-DNA vectors such that vector backbone 
integration in the genome of a eukaryotic cell is minimized or absent. With "optimized T- 
DNA vector" is meant a T-DNA vector designed either to decrease or abolish transfer of 
vector backbone sequences to the genome of a eukaryotic cell. Such T-DNA vectors are 
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known to the one familiar with the art and Include those described by Hanson et al. 
( 1 999) and by Stuiver et al. (1 999 - WO9901 563). 

Tho current invention clearly considers the inclusion of a DNA sequence comprising the 
pronrwDter sequence of the present invention encoding a LRR receptor-iike kinase, 
5 homoiog, derivative or immunologically active fragment thereof as defined supra, in any 
T-DNA vector comprising binary transformation vectors, super-binary transformation 
vectors, co-integrate transformation vectors, Ri-derived transformation vectors as weit as 
in T-DNA carrying vectors used in agrolistic transformation. 

V/ith "binary transformation vector" is meant a T-DNA transformation vector comprising: 
1(1 a T-DNA region comprising at least one gene of Interest and/or at least one selectable 
marker active in the eukaryotic cell to be transformed; and 

a vector backbone region comprising at least origins of replication active in £. coli and 
Agrotacterium and markers for selection in E. coli and Agrobacterium. 
The T-DNA borders of a binary transformation vector can be derived from octopine-type 
15 or rxjpaline-type Ti plasmids or from both. The T-DNA of a binary vector is only 
iranslerred to a eukaryotic cell in conjunction with a helper plasmid. Also known in the art 
aro multiple binary vector Agrobacterium strains for efficient co-transformation of plants 
( B*dney and Scelonge 2000 - WO001 8939). 

VVrtti 'helper plasmid" Is meant a plasmid that is stably maintained in Agrobacterium and 
2<) »s at least carrying the set of vir genes necessary for enabling transfer of the T-DNA. 
Satd set of vir genes can be derived from either octopine-type or nopaline-type Ti 
plasmids or from both. 

Wrth 'super-binary transformation vector" is meant a binary transformation vector 
acl<*tK3nally carrying in the vector backbone region a vir region of the Ti plasmid 
pTiOo542 of the super-viruient A. tumefaciens strain A281 (Hiel et al. 1994 - 
EP06O4662, Hiei et al. 1995 - EP06a7730). Super-binary transformation vectors are 
used ?n conjunction with a helper plasmid. 

With 'co-integrate transformation vector" is meant a T-DNA vector at least comprising: 
a T DNA region comprising at least one gene of interest and/or at least one selectable 

marker active in plants; and 

a vector backbone region comprising at least origins of replication active in Eschericliia 
coll and Agrobacterium, and markers for selection in E. coli and Agrobacterium, and a 
sol of vir genes necessary for enabling transfer of the T-DNA. 
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The T-DNA borders and said set of vir genes of a said T-DNA vector can be derived 
from either octopine-type or nopaiine-type Ti piasmids or from both. 
With "Ri-derived plant transformation vector" is meant a binary transformation vector in 
which the T-DNA borders are derived from a Ti plasmid and said binary transformation 
5 vector being used in conjunction with a 'helper' Ri-plasmid carrying the necessary set of 
v/r genes. 

As used herein, the term "selectable marker gene" or "selectable marker" or "marker for 
selection" includes any gene which confers a phenotype on a cell in which it is 
expressed to facilitate the identification and/or selection of cells vtfhich are transfected or 

10 transformed with a gene construct of the invention or a derivative thereof. Suitable 
selectabie marker genes contemplated herein include the ampicillin resistance (Amp'), 
tetracycline resistance gene (To''), bacteria! kanamycin resistance gene {Kan'), 
phosphinothricin resistance gene, neomycin phosphotransferase gene (npfii), 
hygromycin resistance gene, p-giucuronidase (GUS) gene, chloramphenicol 

15 acetyitransf erase (CAT) gene, green fluorescent protein (gfp) gene (Haseloff et al, 
1997), and luciferase gene, amongst others. 

With "agrolistics", "agrolistic transformation" or "agrolistic transfer" is meant here a 
transformation method combining features of Agrobacterium-mediated transformation 
and of biolistic DNA delivery. As such, a T-DNA containing target plasmid is co-delivered 
20 with DNA/RNA enabling In planta production of VirD1 and VirD2 with or without VirE2 
{Hansen and Chilton 1996; Hansen et al. 1997; Hansen and Chilton 1997 - WO9712046). 
The present invention further describes an approach to remove from transfomned cells a 
stably integrated foreign DNA sequence by recombination involving a recombinase and 
recombination sites. 

25 With "foreign DNA" is meant any DNA sequence that is introduced in the host's genome 
by recombinant techniques. Said foreign DNA includes e.g. a T-DNA sequence or a part 
thereof such as the T-DNA sequence comprising the selectabie marker in an expressible 
format. Foreign DNA furthermore include intervening DNA sequences as defined supra. 

With "recombination event" is meant either a site-specific recombination event or a 
30 recombination event effected by transposon 'jumping'. 

With "recombinase" is meant either a site-specific recombinase or a iransposase. 

With "recombination site" is meant either site-specific recombination sites or transposon 

border sequences. 
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With "site specific recombination event" is meant an event catalyzed by a system 
generally consisting of three elements: a pair of DNA sequences (the site-specific 
recombination sequences or sites) and a specific enzyme (the site-specific 
recombinase). The site-specific reGombinase catalyzes a recombination reaction only 

5 between two site-specific recombination sequences depending on the orientation of the 
site-specific recombination sequences. Sequences intervening between two site-specific 
recombination sites will be inverted in the presence of the site-specific recombinase 
when the site-specific recombination sequences are oriented in opposite directions 
relative to one another (i.e. inverted repeats). If the site-specific recombination 

10 sequences are oriented in the same direction relative to one another (i.e. direct repeats), 
then any intervening sequences will be deleted upon Interaction with the site-specific 
recombinase. Thus, if the site-specific recombination sequences are present as direct 
repeats at both ends of a foreign DNA sequence integrated into a eukaryotic genome, 
such integration of said sequences can subsequently be reversed by Interaction of the 

15 site-specific recombination sequences with the corresponding site specific recombinase. 
A number of different site specific recombinase systems can be used including but not 
limited to the Cre/lox system of bacteriophage P1 , the FLP/FRT system of yeast, the Gin 
recombinase of phage Mu, the Pin recombinase of E. coli, the PinB, PinD and PinF from 
Shigella, and the R/RS system of Zygosaccharomyces rouxil. Recombinases generally 

20 are integrases, resolvases or flippases. Also dual-specific recombinases can be used in 
conjunction with direct or indirect repeats of two different site-specific recombination 
sites corresponding to the duai-specific recombinase (Baszczynski et al. 1999 - 
WO9925840). The preferred site-specific recombinase systems are the bacteriophage 
P1 Cre/lox, the yeast FLP/FRT and the Z rouxil R/RS systems. In these systems a 

25 recombinase (Cre, FLP or R) interact specifically with its respective site-specific 
recombination sequence (lox, FRT, or RS respectively) to invert or excise the intervening 
sequences. The site-specilic recombination sequences for each of these two systems 
are relatively short (34 bp for iox and 47 bp for FRT). Some of these systems have 
already been used with high efficiency in plants such as tobacco (Dale et ai. 1990, 

30 Onouchi et al. 1991, Sugita et al. 2000) and Arabidopsis (Osborne et ai. 1995, Onouchi 
et al. 1995). Site-specific recombination systems have many applications in plant 
molecular biology including methods for control of homologous recombination (e.g. 
Hodges et al. 1996 - US5527695), for targeted insertion, gene stacking, etc. 
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(Baszczynski et ai. 1 999 - W09925821 ) and for resolution of complex T-DNA integration 
patterns or for excision of a selectable marker (Ow et a!. 1 999 - WO9923202). 
Althougli the site-specific recombination sequences must be linked to the ends of the 
DNA to be excised or to be inverted, the gene encoding the site specific recombinase 

5 may be located elsewhere. For example, the recombinase gene could already be 
present in the eukaryote's DNA or could be supplied by a later introduced DNA fragment 
either introduced directly into cells, through crossing or through cross-pollination. 
Alternatively, a substantially purified recombinase protein could be introduced directly 
into the eukaryotic cell, e.g. by micro-injection or particle bombardment. Typically, the 

10 site-specific recombinase coding region will be operably linked to regulatory sequences 
enabling expression of the site-specific recombinase in the eukaryotic cell. 
With "recombination event effected by transposon jumping" or "transposase-mediated 
recombination" is meant a recombination event catalyzed by a system consisting of 
three eierrients: a pair of DNA sequences (the transposon border sequences) and a 

15 specific enzyme (the transposase). The transposase catalyzes a recombination reaction 
only between two transposon border sequences which are arranged as inverted repeats. 
A number of different transposon/transposase systems can be used including but not 
limited to the Ds/Ac system, the Spm system and the Mu system. These systems 
originate from corn but it has been shown that at least the Ds/Ac and the Spm system 

20 also function in other plants {Fedoroff et al. 1993, Sohiappi et al. 1993, Van Siuys et al. 
1987). Preferred are the Ds- and the Spm-type transposons which are delineated by 11 
bp- and 1 3 bp- border sequences, respectively. 

Although the transposon border sequences must be linked to the ends of the DNA to be 
excised, the gene encoding the transposase may be located elsewhere. For example, 
25 the recombinase gene could already be present in the eukaryote's DNA or could be 
supplied by a later introduced DNA fragment either introduced directly Into cells, through 
crossing or through cross-pollination. Alternatively, a substantially purified transposase 
protein could be introduced directly into cells, e.g. by microinjection or by particle 
bombardment. 

30 As part of the current invention, transposon border sequences are included in a foreign 

DNA sequence such that they lie outside said DNA sequence and transform said DNA 
into a transposon-like entity that can move by the action of a transposase. 
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As transposons often reintegrate at another iocus of the host" s genome, segregation of 
the progeny of the hosts in which the transposase was allowed to act might be 
necessary to separate transformed hosts containing e.g. only the transposon footprint 
and transfornfied hosts still containing the foreign DNA. 

5 In perfornriing the present invention, the genetic element is preferably induced to 
mobilize, such as, for example, by the expression of a recombinase protein in the ceil 
which contacts the integration site of the genetic element and facilitates a recombination 
event therein, excising the genetic element completely, or alternatively, leaving a 
"footprint", generaiiy of about 20 nucleotides in length or greater, at the original 

10 integration site. Those hosts and host parts that have been produced according to the 
inventive method can be identified by standard nucleic acid hybridization and/or 
amplification techniques to detect the presence of the mobilizable genetic element or a 
gene construct comprising the same. Alternatively, in the case of transformed host ceils, 
tissues, and hosts wherein the mobilizable genetic element has bene excised, it is 

15 possible to detect a footprint in the genome of the host which has been left following the 
excision event, using such techniques. As used herein, the term "footprint" shaff be taken 
to refer to any derivative of a mobilizable genetic element or gene construct comprising 
the same as described herein which is produced by excision, deletion or other removal 
of the mobilizable genetic element from the genome of a cell transformed previously with 

20 said gene construct, A footprint generally comprises at least a single copy of the 
recombination loci or transposon used to promote excision. However, a footprint may 
comprise additional sequences derived from the gene construct, for example nucleotide 
sequences derived from the left border sequence, right border sequence, origin of 
replication, recombinase-encoding or transposase-encoding sequence if used, or other 

25 vector-derived nucleotide sequences. Accordingly, a footprint is identifiable according to 
the nucleotide sequence of the recombination locus or transposon of the gene construct 
used, such as, for example, a sequence of nucleotides corresponding or complementary 
to a lox site, frt site or RS site. 

The term "cell cycle" means the cyclic biochemical and structural events associated with 
30 growth and with division of cells, and in particular with the regulation of the replication of 
DNA and mitosis. Cell cycle includes phases called: GO, Gap1 (G1), DNA synthesis (S), 
Gap2 (G2}, and mitosis (M). Normally these four phases occur sequentially, however, 
the cell cycle also includes modified cycles wherein one or more phases are absent 
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resulting in modified cell cycle such as endomitosis, acytokinesis, polyploidy, polyteny, 

and endoreduplicatlon. 

The term "cell cycle interacting protein", "ceil cycie protein" or "ceil cycle control protein" 
as denoted herein means a protein which exerts control on or regulates or is required for 
5 the cell cycle or part thereof of a cell, tissue, organ or whole organism and/or DNA 
replication. It may also be capable of binding to, regulating or being regulated by cyclln 
dependent kinases or their subunits. The term also includes peptides, polypeptides, 
fragments, variant, homologs, alleles or precursors (eg preproproteins or preproteins) 
thereof. 

10 Cell cycle control proteins and their role in regulating the cell cycle of eukaryotic 
organisms are reviewed in detail by John (1981) and the contributing papers therein 
(Norbury & Nurse 1 992;Nurse 1 990;Ormrod & Francis 1 993) and the contributing papers 
therein (Doerner et al. 1996) which are incorporated by reference.(EI!edge 1996;Francis, 
Francis & Halford 1 995;Hirt et al. 1991 ;Mironov et al. 1999) 

15 The term "cell cycle control genes" refers to any gene or mutant thereof which exerts 
control on or are required for: chromosomal DNA synthesis and for mitosis (preprophase 
band, nuclear envelope, spindle formation, chromosome condensation, chromosome 
segregation, formation of new nuclei, formation of phragmoplast, duplicatior^ of 
microtubuie-organizing center, etc) meiosis, cytokinesis, cell growth, endoreduplication, 

20 cell cycie control genes are also all genes exerting control on the above: homologs of 
CDKs, cyclins, E2Fs, Rb, CKI, Cks, and also any genes which interfere with the above, 
cyclin D, cdc25, Weel, Nimi, MAP kinases, etc. 

More specifically, ceil cycle control genes are ail genes involved in the control of entry 
and progression through S phase. They include, not exclusively, genes expressing "cell 
25 cycle control proteins" such as cyclln dependent kinases (CDK), cyclin dependent kinase 
inhibitors (CKI), D, E and A cyclins, E2F and DP transcription factors, pocket proteins, 
CDC7/DBF4 kinase, CDC6, MCM2-7, Ore proteins, cdc45, components of SCF ubiquitin 
ligase, PCNA, DNA-polymerase. 

The term "cell cycle control protein" include cyclins A, B, C, D and E including CYCA1 ;1 , 
30 CYCA2;1 , CYCA3;1, CYCB1;1, CYCB1;2, CYC B2;2, CYCD1;1, CYCD2;1, CYCD3;1, 
and CYCD4;1 (Evans et al. 1983;Francis et al. 1998;Labbe et al. 1989;Murray & 
Kirschner 1989;Renaudin et al. 1996;Soni et al. 1995;Sorreil et al. 1999;Swenson, 
Farrell, & Ruderman 1986) cyclin dependent kinase inhibitor (CKI) proteins such as ICK1 
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(Wang, Fowke, & Crosby 1997), FL39, FL66, FL67 (PCT/EP98/05895), Slc1, Far1, 
Rum1, p21, p27, p57, p16, p15, p18, p19 (Eliedge 1996;Pines 1995), p14 and p14ARF; 
p13suc1 or CKSIAt (De VeySder et a!. l997;Hayles & Nurse 1986) and nim-l (Fantes 
1989;Russell & Nurse 1986;Russeil & Nurse 1987b;Russe!l & Nurse 1987a;Rusself & 

5 Nurse 1986) honnoiogues of Cdc2 such as Cdc2MsB (Hirt et al. 1993) CdcMs kinase 
(Bogre et al. 1997) cdc2 T14Y15 phosphatases such as Cdc25 protein phosphatase or 
p80cdc25 (Bell et al. 1993;Eiledge 1996;Kumagai & Dunphy 1991;Russeii et al. 1986) 
and Pyp3 (Eliedge 1996) cdc2 protein kinase or p34cdc2 (Colasanti, Tyers, & 
Sundaresan 1991;FeiCer & Jacobs 1990;Hirt et al. 19gi;John, Sek, & Lee 1989",Lee & 

10 Nurse 1987;Nurse & Bissett 1981;Ormrod et al. 1993) cdc2a protein kinase (Hemerly et 
al. 1993) cdc2 T14Y15 kinases such as wee1 or p107wee1 (Eliedge 1996;Russell et al. 
1986;Russell et al. 1 987b;Russell et al. 1987a;Sun et al. 1999) rpik1 (Lundgren et al. 
1991) and mytl (Eliedge 1996); cdc2 T161 kinases such as Cak and Civ (Eliedge 1996); 
cdc2 T161 phosphatases such as Kap1 (Eliedge 1996); cdc28 protein kinase or 

15 p34cdc28 (Nasmyth 1993;Reed, Hadwiger, & Lorinoz 1985) p40MO15 (Fesquet et al. 
1993;Poon et al. 1993) chk1 kinase (Zeng et al. 1998) cds1 kinase (Zeng et al. 1998) 
growth-associated HI kinase (Labbe et al. 1989;Lake & Salzman 1972;Langan 
1978;Zeng et al. 1998) MAP kinases described by (Binarova et al. 1998;B6gre et al. 
1999;Calderini et al. 1998;Wiison et a!. 1999). 

20 Other cell cycle control proteins that are involved in cyciin D-mediated entry of cells into 
G1 from GO include pRb (Huntley et al. 1998;Xie et al. 1996) E2F, RIP, MCM7 and 
potentially the pRb-iIke proteins p107 and pi 30. 

Other cell cycle control proteins that are involved in the formation of a pre-replicative 
complex at one or more origins of replication, such as, but not limited to, ORG, CDC6, 

25 CDC1 4, RPA and MCM proteins or in the regulation of formation of this pre-replicative 
complex, such as, but not limited to, the CDC7, DBF4 and MBF proteins. 
For the present purpose, the term "cell cycle control protein" shall further be taken to 
include any one or more of those proteins that are Involved in the turnover of any other 
cell cycle control protein, or in regulating the half-life of said other cell cycle control 

30 protein. The term "protein turnover" is to include all biochemical modifications of a 
protein leading to the physical or functional removal of said protein. Although not limited 
to these, examples of such modifications are phosphorylation, ubiquitination and 
proteolysis. Particularly preferred proteins which are involved in the proteolysis of one or 
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more of any other of the above-mentioned cell cycle control proteins include the yeast- 
derived and animal-derived proteins, Skp1 , Skp2, Rub1 , Cdc20, cuilins, CDC23. CDC27, 
CDC16, and plant-derived homologues thereof (Cohen-Fix & Koshland 
1997;Hochstrasser 1998;Krek 1998;Lisztwan et al. 1998) and Please et al In (Francis et 

5 aL 1998)). 

For the present purpose, the term "cell cycle control genes" shall further be taken to 
include any one or more of those gene that are involved in the transcriptional regulation 
of cell cycle control gene expression such as transcription factors and upstream signal 
proteins. Additional ceil cycle control genes are not excluded. 

10 For the present purpose, the term "cell cycle control genes" shall further be taken to 
include any cell cycle control gene or mutant thereof, which is affected by environmental 
signals such as for instance stress, nutrients, pathogens, or by intrinsic signals such as 
the animal mitogens or the plant hormones (auxins, cytokinins, ethylene, gibberelllc acid, 
abscisic acid and brassinosteroids). 

15 The tenm "cell cycle progression" refers to the process of passing through the different 
cell cycle phases. The term "cell cycle progression rate" accordingly refers to the speed 
at which said cell cycle phases are run through or the time spans required to complete 
said cell cycle phases. 

With "pathogen" is meant those organisms that have a negative effect on the 
20 physiological state of the plant or a part thereof. Some pathogens are for instance 
viruses, bacteria,, fungi, and parasitic plants, with plant "pests" is meant the group of 
nematodes as well as insects, which are able to attack and to damage the plant. 
"Plant cell" comprises any ceil derived from any plant and existing in culture as a single 
cell, a group of cells or a callus. A plant ceil may also be any ceil in a developing or 
25 mature plant in culture or growing in nature. 

"Plants" comprises all plants, including monocotyledonous and dicotyledonous plants. 
With "yeast two-hybrid assay" is meant an assay that is based on the obsen/ation that 
many eukaryotic transcription factors comprise two domains, a DNA-blnding domain 
(DB) and an activation domain (AD) which, when physically separated (i.e. disruption of 
30 the covalent linkage) do not effectuate target gene expression. Two proteins able to 
interact physically with one of said proteins fused to DB and the other of said proteins 
fused to AD will re-unite the DB and AD domains of the transcription factor resulting in 
target gene expression. The target gene in the yeast two-hybrid assay is usually a 
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reporter gene such as the p-galactosldase gene. Interaction between protein partners in 
tlie yeast two-hybrid assay can thus be quantified by measuring the activity of the 
reporter gene product (Bartel and Fields 1997). Alternatively, a mammalian two-hybrid 
system can be used which includes e.g. a chimeric green fluorescent protein encoding 

5 reporter gene (Shioda et al. 2000). Yet another alternative consists of a bacterial two- 
hybrid system using e.g. HIS as reporter gene (Joung et al. 2000). 
The term "fragment of a sequence" or "part of a sequence" means a truncated sequence 
of the original sequence referred to. The truncated sequence (nucleic acid or protein 
sequence) can vary widely in length; the minimum size being a sequence of sufficient 

10 size to provide a sequence with at least a comparable function and/or activity or the 
original sequence referred to, vMle the maximum size is not critical. In some 
applications, the maximum size usually is not substantially greater than that required to 
provide the desired activity and/or function{s) of the original sequence. Typicaily, the 
truncated amino acid or nucleotide sequence will range from about 5 to about 60 amino 

15 acids in length. More typically, however, the sequence will be a maximum of about 50 
amino acids in lenght, preferably a maximum of about 60 amino acids. It is usually 
desirable to select sequences of at least about 10, 12 or 15 amino acids or nucleotides, 
up to a maximum of about 20 or 25 amino acids or nucleotides. 

Furthermore, folding simulations and computer redesign of structural motifs of the 
20 protein of the invention can be performed using appropriate computer programs 
(Olszewski et al.1996, Hoffman et al.1995). Computer modeling of protein folding can be 
used for the conformational and energetic analysis of detailed peptide and protein 
models (Monge et al. 1995, Renouf et al. 1995). In particular, the appropriate programs 
can be used for the identification of Interactive sites of the LRR receptor-like kinase of 
2.5 the present invention by computer assistant searches for complementary peptide 
sequences (Fassina and Melii 1 994). Further appropriate computer systems for the 
design of protein and peptides are described in the prior art, for example in Berry and 
Brenner (1994), Wodak (1987), Pabo and Suchanek (1986). The results obtained form 
the above-described computer analysis can be used for, e.g. the preparation of 
30 peptidomlmetics of the protein of the invention or fragments thereof. Such 
pseudopeptide analogues of the natural amino acid sequence of the protein may very 
efficiently mimic the parent protein (Benkirane et al. 1 996). For example, incorporation of 
easily available achira! (o-amino acid residues into a protein of the invention or a 
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t-agment thereof results in the substitution of amino bonds by polymethyiene units of an 
ahptxatlc chain, thereby providing a convenient strategy for constructing a peptidomimetic 
(E^inerjee et al. 1996). Superactive peptidomimetic analogues of small peptide 
Mormones in other systems are described in the prior art (Zhang et al. 1996). Appropriate 
peptidomimetics of the protein of the present invention can also be identified by the 
synthesis of peptidomimetic combinatorial libraries through successive amine alkylation 
and testing the resulting compounds, e.g., for their binding, kinase inhibitory and/or 
mmunlogical properties. Methods for the generation and use of peptidomimetic 
combinatioral libraries are described in the prior art, for example in Ostresh et al. (1996) 
and Domer et al. (1 996). 

furthermore, a three-dimensional and/or crystaiSographic structure of the protein of the 
nventwn can be used for the design of peptidomimetic inhibitors of the biological activity 
c4 !ho protein of the invention (Rose et a!. 1996, Ruterber et a!. 1996). 
To akgn sequences we used the Ciustal V method described in Higgins and Sharp 
ihttcjgsns DG & Sharp PM, Comput AppI BioscL 1989 Apr;5(2)151-3). The Ciustal Method 
groups sequences into clusters by examining sequence distances between alt pairs. 
Ousters are aligned as pairs then collectively as sequence groups to produce the overall 
alignment. After the multiple alignment is completed, a neighbor-joining method is 
©mptoyed to reconstruct phylogeny for the putative alignment. 
1 . Residue weight table 

The Ciustal method uses weight tables to construct multiple alignments. Residue 
weight tables are used in scoring protein and nucleotide alignments so that 
slightly mismatched residues, such as lie vs. Val amino acids or G vs. R 
nucleotides, are not scored the same as total mismatches, 
if your sequences are protein, or a mixture of protein and DNA, choose between 
PAM250, PAM100, Structural and Identity. 

1. iPAM250 is recommended for the Ciustal method and is ideal for longer 
sequences or highly diverged sequences. 

2. PAM100 is also used with the Ciustal method and is most accurate for 
shorter sequences or sequences having little divergence. 

3. iStructura! is recommended for the Jotun Main method and gives higher 
scores to residues that are chemically and spatially similar. 
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4. Identity may be used with Clustal or Jotun Hein methods, but is 
recommended only for the Clustal method. The table scores non-identities as 
zero, and identities as a default value of 1 0. 

2. Method Parameters 

A. Pairwise alignment parameters may include K-tuple, Gap Penalty, Window 
and Diagonals Saved: 

1 . Specify the K-tuple, expressed as the unit's length in nucleotides. Any stretch 
of K or more adjacent nucleotides that form an exact match in both 
sequences is a K-tuple. A lower K-tupie will find smaller matching regions; a 
higher value will find fewer, but better, matching regions of similarity. 

2. Type in the Gap Penalty, the amount deducted from the score for each gap in 
the alignment. Gaps of different sizes carry the same penalty. 

3. Specify the Window, the range the program searches for all possible or 
aliernate K-tup!es. An extremely high value rnay slow an alignment by 
considering subalignments far from the main diagonal. 

4. Type in the number of Diagonals Saved. In the painMse calculalion, the best 
scoring diagonals for each pair in the matrix are saved and used in the 
clustering and scoring of ancestors. 

B. Multiple aHgnment parameters include Gap Penalty and Gap Length Penalty. 

1 , Specify the Gap Penally, the amount deducted from the score lor each gap in 
the alignment. Gaps of different sizes carry the same penalty. 

2. Specify the Gap Length Penalty, the value which will be deducted from the 
alignment score after first multipiying it by the number of gaps. Longer gaps 
have a greater penalty than shorter gaps. 

Whenever in the current description reference is made to a "phylogenetic tree", this 
phylogenetic tree is used to view phylogenetic relationships compatible with multiple 
sequence alignments. The length of each pair of branches represents the distance 
between sequence pairs, while the units at the bottom of the tree indicate the number of 
substitution events. 
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The phyiogenetic tree gives a first estimate of the relationships between homologous 
sequences. It Is highly recommended that your data be evaluated using additional 
methods before finai conclusions are made about evolutionapy relationships. 
You can switch the view between Balanced Branches (phenogram) or Unbalanced 

5 Branches (cladogram). A phenogram averages the distances between ancestors in the 
tree. Dotted lines indicate a negative branch length, which may be a byproduct of 
averaging. In the cladogram, branch distances correspond to sequence divergence. 
The compounds to be obtained or identified in the methods of the invention can be 
compounds that are able to bind to any of the nucleic acids, peptides or proteins of the 

10 invention. Other interesting compounds to be identified are compounds that modulate 
the expression of the genes or the proteins of the invention in such a way that either the 
expression of said gene or protein is enhanced or decreased by the action of said 
compound. Alternatively the compound can exert his action by directly or indirectly 
enhancing or decreasing the activity of any of the proteins of the invention. 

15 Said compound or plurality of compounds may be comprised in, for example, samples, 
e.g., cell extracts from, e.g., piants, animals or microorganisms. Furthermore, said 
compound(s) may be known in the art but hitherto not known to be capable of 
suppressing or activating eel! cycie interacting proteins. The reaction mixture may be a 
cell free extract of may comprise a cell or tissue culture. Suitable set ups for the method 

20 of the invention are known to the person skilled in the art and are, for example, generally 
described in Alberts et al.. Molecular Biology of the Cell, third edition (1994), in particular 
Chapter 17. The plurality of compounds may be, e.g., added to the reaction mixture, 
culture medium or injected into the cell. 

If a sample containing a compound or a plurality of compounds is identified in the 
25 method of the invention, then it is either possible to isolate the compound form the 
original sample identified as containing the compound capable of acting as an agonist, 
or one can further subdivide the original sample, for example, if it consists of a plurality 
of different compounds, so as to reduce the number of different substances per sample 
and repeat the method with the subdivisions of the original sample. Depending on the 
30 complexity of the samples, the steps described above can be performed several times, 
preferably until the sample identified according to the method of the invention only 
comprises a limited number of or only one substance(s). Preferably said sample 
comprises substances or similar chemical and/or physical properties, and most 
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preferably said substances are identical. Preferably, the compound identified according 
to the above described method or its derivative is further formulated in a form suitable for 
the application in plant breeding or plant cell and tissue culture. 

The invention, now being generally described, will be more readily understood by 
5 reference to the following examples, which are included merely for purposes of 
illustration of certain aspects and embodiments of the present invention and are not 
intended to limit the invention. 
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DESCRIPTION OF FIGURES 

Figure 1 . Models for stem-cell maintenance in the root and shoot meristem. 

(a) In the root meristem, the quiescent center (QC) functions as an organizing center. 

5 Those cells in immediate contact with the QC are stem cells (dark shading), which are 
initials for al! cell files: endodermis (end), cortex (cort), epidermis (epi), lateral root cap 
(Ire) and columella (co!). (b) In the shoot meristem, WUS is functionally equivalent to the 
QC in the root meristem in maintaining the presence of stem cells (dark shading). Long- 
range signaling for stem cell m.aintenance is also postulated for ZLL, which is expressed 

10 in the vascular cells. CLV3 is expressed in the presumptive stem ceils that the initials for 
the L1 (epidermis), L2 (subepidermal tissues, i.e. mesophyil cells) and L3 (subepidermal 
tissues, i.e. vascular ceils) tissue layers (Schnittger, Schellmann, & Hulskamp 1 999). 
CLV1 is found in the L3 tunica layer of the central zone. 

15 Figure 2. List of the degenerative primers used for RT-PCR of root-specific members of 
the LRR receptor kinase family. 

Figure 3. Differential RT-PCR on hypocotyl/cotyledon (C) and roottip (R) RNA. 

The PGR fragments obtained upon amplification of R and 0 cDNAs with degenerate 
20 primers are separated on agarose gel. DNA fragments of different size can be detected 
in the R22 compared to C22 PCRs. 

M represents the 1 00 bp marker (Boehringer). C and R indicated the origin of th$ cDNA 
sample used in the PGR and the numbers below indicate the primer combination used, 
e.g. G23 represents hypocotyl/cotyledon cDNA amplified with primers LRR2 and KINS. 

25 

Figure 4. RT - PCR analysis of RCH1 expression. 

Total RNA was isolated from flower (F), hypocotyl/cotyledon (C) and rooitips (R) and 
RCH1 mRNA was amplified for 26 cycles with RT-PCR using the RCH1F and RCH1R 
primers. As a control for cDNA quality and quantity as well as relative expression level, 
30 ubiquitin (Ubi) mRNA was amplified for 20 cycles under the same conditions using the 
UBIF and UBIR primers. 
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Figure 5. The nucleotide sequence of the RCH1 operon. 

This figure shows het nucleotide sequence of SEQ ID NO 1 that contains the genomic 
RCH1 clone. 

5 

Figure 6, The siucleotlde sequence of the RCH1 promoter region. 
This fragment of the genomic RCH1 clone is about 3498 base pairs and is identified as 
SEQ ID NO 2. Two putative ATG start codons of the RCH1 protein on position 3479- 
3481 and on position 3491-3493 couid be identified and are represented in bold. The 

10 underlined sequence corresponds to SEQ ID NO 1 8. 

It is aiso known that there is a gene in the reverse orientation in this promoter region. 
This gene's open reading frame starts at the position 1759-1757 with the ATG (bold), 
contains two introns, and ends at position 350-348 with a TGA codon (bold). A homology 
search of the protein sequence of this antisense gene in the RCHIprom against the 

15 protein database with the program BLAST? 2.0.8 (Altschul et al. 1997) produced a 
hypothetical protein (Accession number AC002392, clone T20K24.8) that is a putative 
anthranilate N-hydroxycinnamoyl/benzoyltransf erase from Arabidopsis thaliana. Both 
sequences show 38% identity and 54% of their amino acids are homologous. 

20 Figure 7. Promoter activity of the RCH1 promoter - GUS fusion construct in tlie 
Arabidopsis thaliana roots. 

Seedlings of different age were grown and stained with Glucuronidase (GUS). The 
expression of the GUS reporter gene can be seen in the root meristem of the main and 
the lateral roots, as well as in the vascular tissue of the root and the lateral root 

25 primordia. (A) 1 day old seeding stained over p.ight, (B) higher magnification of A, (C) 2 
days old seedling stained for 5 hours, (D) main root from 1 day old seedling stained for 1 
hour, (E) lateral root primordia from 1 week old seedling stained for 5 hours, (F, G, H) 
lateral roots from an 8 days old seedling stained for 5 hours. 

30 Figure 8. The nucleotide sequence of the RCH1 gene. 

This fragment of the genomic RCH1 clone Is about 3960 base pairs long and is identified 
as SEQ ID NO 3. Two putative ATG start codons of the RCH1 protein on position 1-3 
and on position 13-15 could be identified and are represented in bold. Also the TAA 
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stopcodon on position 3958-3960 is represented in bold. There is a putative intron 
estimated to range from position 2900 (CAGaGTA) to position 3452 (CAGaAAT) with the 
program http://www.CBS.DTU.DK/services/NetGene2 . The putative splice-sites are 
underlined. 

5 

Figure 9. The amino acid sequence of the open reading frame encoding RCH1 
protein. 

Two putative start methionins of the RCH1 protein on position 1 and on position 4 could 
be identified. An ORF of 1 1 35 amino acids is identified and represented in SEQ ID NO 4. 
10 The ORF of 1131 amino acids and starting at the methionine on position 4 (0) 
corresponds with SEQ ID NO 19. 

The boxed peptide |RYKI I LG AAQGLAYLH H DCVPPI VH| corresponds with SEQ ID NO 5. 

Figure 10. Amino acid alignment of the RCH1 protein the Arabidopsis thaliana 
1 5 CLV1 receptor kinase lilce protein. 

The amino acid sequences of the RCH1 and of CLV1 (Genbank accession number 
AAB58929) were aligned using the program GAP as described in example 3, Match 
display thresholds for the alignment(s) are: | = IDENTITY, : = 2, . = 1 . 

20 Figure 1 1 . Phylogenetic tree of CLV1 homologues. 

The novel RCH1 sequence was aligned with the CLV1 (Genbank accession number 
U96879.1), the known CLV1 homologues BRI1 (Genbank accession number 
AF0 17056), HAE (GenPept accession number 827756) and ER (Genbank accession 
number D83257) as weii as with 4 other related proteins that vi/ere retrieved from the 
25 public database searched with the RCH1 sequence using the BiastP program. The 
multiple alignment and phylogenetic tree of said sequences were calculated with the 
Clustal V method as described in example 4. Based on the sequence, RCH1 is more 
homologue to CLV1 than other known CLV-homologues. 

30 Figure 12. Phylogenetic tree of LRR receptor like kinases 

The novel RCH1 sequence was aligned with GLV1, its homologues BRI, HAE, ER and 
other LRR receptor like kinases retrieved from the public database. Also plant disease 
resistance genes, which comprise a leucine-rich repeat and which are receptor-like were 
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included in the alignment (RPS5, RPS2, RPP8,RPM1, RPS4, N, RPP5, Meyers ef a/., 
1999, Plant J;, Nov20(3): 31 7-32). The multiple alignment and phylogenetic tree of said 
proteins were calculated with the Clustal V method as described in example 4. Because 
of the close proximity of RCH1 to CLV1 in the phylogenetic tree, there is more 
5 evolutionary relationship between RCH1 and clavata, than with other kinds of LRR 
receptor like proteins such as plant disease resistance genes. 

Figure 13. The nucleotide sequence of a minima! promoter sequersce of RCH1 
This figure shows the nucleotide sequence of SEQ ID NO. 18 that contains a minima! 
10 promoter sequence of the RCHI clone. This fragment of the genomic RCH1 clone is 
about 842 base pairs. This sequence is deducted from the iarge promoter region 
identified as SEQ ID NO 2 (see example 7). The minimal promoter region correlates with 
the base pairs ranging from -1 of the first ATG defined in SEQ ID NO. 2 to -842 of the 
first ATG defined in SEQ ID NO. 2. 

15 

Figure 14. Outprint of GenBank accession numi}er AB017061 on December 4, 
2000. 

Figure 15. RCIHI1prom::GUS expression in embryo's 

20 Arabidopsis embryo transformed with RCH1prom::GUS. Embryo's were stained with 
beta glucuronidase. The RCHI promoter is active very early in plant development and Is 
only active in the root in the embryonic phase (A). On a stronger magnification (B) it is 
visible that the root specific expression Is limited to the meristematic zone of the 
embryonic root. 

25 

Figure 16. RCH1"GFP expression In roots 

RCHI promoter is strongly active in endodermis, cortex, epidermis, lateral root cap, in 
short: in the "division zone". The RCHI promoter is also active but in a low manner, in 
the quiescent center and in the vascular tissue. 

30 

Figure 17. Amino acid sequence of RCH1 homoiogues of Table 4 
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EXAMPLES 

Unless stated otherwise in the Examples, all recombinant DNA techniques are 
performed according to protocols as described in Sambroolt et al. (1989), Molecular 
5 Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, NY or in Volumes 
1 and 2 of Ausubel et al., (1994), Current Protocols in Molecular Biology, Current 
Protocols. Standard materials and methods for plant molecular work are described in 
Plant Molecular Biology Labfase (1993) by R.D.D. Croy, jointly published by BIOS 
Scientific Publications Ltd. (UK) and Blackwell Scientific Publications (UK). 

10 

Example 1 : Isolation of the RCH1 operon from root tissue of Arabidopsis thaliana . 

Plant organ development depends on the presence of distally positioned groups of 
continuousiy dividing cells, the shoot and root apical meristems. Laser ablation studies 
suggest that a baiance between signals for proper differentiation and short range signals 

15 from the mitotically inactive quiescent centre (QC), required to keep ceils less 
differentiated, is important to maintain the root meristem (van den Berg et al 1995;van 
den Berg et al. 1997), In the shoot meristem a balance between the WUSCHEL and 
CLAVATA (CLV) genes has been implicated to piay an essential role in regulation cell 
division and differentiation (Clark et al. 1993; Fletcher et al. 1999;Schoof et al. 2000). 

20 The CLV1 gene encodes a leucine rich repeat (LRR) receptor kinase (Clark, Williams, & 
Meyerowitz 1997), a member of a large gene family (Lease, Ingham, & Walker 1998). 
Other members are ERECTA {ER) (Torii et al. 1996), BRASSINOLIDE INSENSITVE 
[BRl) (Li et al. 1997) and HAESA (H4£) (Jinn et al. 2000), genes which have a function 
in plant development 

25 We investigated whether root specific CLV1 homologs exist using a differential RT-PCR 
based approach. PGR products were generated from root tip and hypocotyl/cotyledon 
cDNA pools using degenerate primers, resulting in the isolation of the root specific 
ROOT CLAVATA HOMOLOG 1 {RCH1). The promoter region of this gene was 

translationally fused to P-glucoronidase-greepi fluorescent protein {GUS/GFP) conferring 
30 root specific expression of these reporter genes (see Example 2). 
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Designing degenerate primers 

To RT-PCR root specific members of the LRR receptor kinase gene family we compared 
the sequences of the CLV1, ER and HAE genes and their encoded proteins. The 
degenerate LRR primers were designed against the NxLxGxiP encoding region of the 
5 xLxxNxLxGxIPxxLxxLxxLxxL consensus for the LRR receptor-iii<e kinases. Degenerate 
kinase primers were disigned for different homologous regions of the kinase domain. 
The primers are listed in Figure 2. 

RNA Isolation 

10 For each RNA isolation the roottips and the hypocotyl/cotyledon part including the shoot 
apical meristem of Arabidopsis thaliana (ColO) plants, 4 days after germination, were 
collected. 

Total RNA was isolated (Pawiowski et al. 1994) and chromosomal DNA contamination 
was removed upon Dnase ! (Promega, l\yiadison, Wl) treatment, following the instructions 
15 of the suppliers. The amount and quality of RNA was determined using 
spectrophotometer and agarose gel electrophoresis (data not shown). 

Differential RT-PCR 

Reverse transcription (RT) was performed using 5 fig of root tip (R) and 
20 hypocotyi/cotyledon (C) total RNA in a 20 reaction volume with 0.5 \ig oligodTi2.,B 
(Pharmacia, New Jersey, USA) and 200 U Superscript 11 (LifeTechnoiogies, Paisley, 
UK), according to the instructions of the supplier. Following first strand cDNA synthesis 
the cDNA samples were treated with RNase H (LifeTechnoiogies) as instructed by the 
supplier and subsequently diluted to a total volume of 1 00 fxl with water. 
25 Amplification reactions were performed with all degenerate LRR anc KIN d nmer pairs 
(Figure 2). For the polymerase chain reaction (PGR) 2.5 |a.l of the \ or C cDNA sample 
was amplified in a total volume of 50 \i\ reaction volume contain-.' ., 100 \M dNTPs, 100 
ng LRR and KIN primer, 1 U Taq polymerase (Boehringer, Inge^ ;im, Germany) and its 
accompanying buffer. The cDNAs were amplified during 40 eye (1 min at 94°C, 1 min 
30 at 45-55°C, 3 min at 72°C). 
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Cloning and sequencing 

The amplified R and C cDNAs were separated and compared witii agarose gel 
electrophoresis (Figure 3). The root specific fragments, e.g. fragments present in R22 and 
not in C22, were isolated from gei, cioned into the pGEM-T vector (Promega) and 

5 transformed into E. coil (strain DH5a), using standard cloning procedures. 

Several PGR fragments of the same size but from different members of the LRR 
receptor kinase gene famiiy may be generated using degenerate primer pairs. Therefore, 
24 randomly picked coionies obtained after transformation were used in a "colony PGR". 
Aliquots of the amplified inserts were separated on agarose get followed by Southern 

10 transfer to a Supercharge N+ membrane (Schleicher & Schuell, Dassel, Germany) to 
create a "colony blof . An aliquot of one insert was labeled with ^^P-dCTP and hybridized 
to the colony blot to determine the number of clones it represented in the poo! of 24. This 
procedure was repeated with other labeled inserts until all clones were identified. 
A single representative of each pool of clones was sequenced and the sequence was 

15 analyzed using the BU\ST program. One of the clones sequenced was named RCH1 
and showed strong homology to plant LRR receptor kinases. 

RT-PCR 

The relative expression levels of RCH1 in root tip (R), hypocotyl/cotyledon (C) and 
20 flowers (F) were determined using RT-PCR. Primers specific for the RCH1 sequence 

were designed (RCH1F and RCH1R, Figure 2). The PGR sampies were analyzed with 
agarose gei electrophoresis. Ubiquitin {Ubi) mRNA was used as an internal control 
(Hoa'ath et ai. 1993). Polyubiquitin genes consist of multiple units (Callis et al. 1995) and 
the Ubi primers hybridize with the ends of each single unit. 
25 independently Isolated total RNA v-/as reverse transcribed to obtain cDNA and for the 
PGR reaction 2 fx! of the cDNA sample was amplified in a total volume of 50 |x) as 
described abovefor the differential RT-PCR. The RCH1 and Ubi cDNAs were amplified 
during 26 or 20 cycles (30 sec at 94°C, 30 sec at 56°C, 30 sec at 72°C), respectively 
(Figure 4). RCH1 expression was only observed in root tip (R). 

30 

Example 2 : Promoter isolation and promoter fusion construct 

To determine if the RCH1 promoter region is sufficient to confer root specific expression 
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of a "gene of interest" we decided to fuse this promoter to the Glucuronidase/Green 
Fluorescent Protein (GUS/GFP) reporter gene. The RCH1 clone was used to isolate a 
phage containing the corresponding genomic region from a XGEiVi (Promega, Madison, 
Wl) based genomic library. The X-GEM11 phage containing an insert of about 14550 

5 nucieotides comprising the Arabidopsis ihaliana RCH1 promoter and RCH1 gene and 
also the surrounding genomic sequences was deposited on October 25, 2000 at the 
BCCM-LMBP plasmid collection and was named "lambda phage RCH1 genomic". This 
deposit was given the accession number LMBP 5582CB by the international depositary 
authority. A restriction map was constructed of the genomic clone and a 7 kb Xhoi 

10 fragment, containing 3.5 kb promoter and part of the RCH1 coding sequence, was 
subcloned into the expression vector pBS (Stratagene, La Joila, CA). The 3.5 kb" 
pronnoter region including the first amino acids of the putative RCH1 ORF is set forth in 
Figure 5 and is Identified as SEQ ID NO 2. This promoter region was amplified using the 
prinners M13F and PRCH1R and Pfu Taq polymerase (Stratagene, La Jolla, CA) In the 

15 manufacturer specified conditions. The promoter region was subsequently cloned into 
the binary pCAMBIA3383Xb vector (Roberts et al. 2000) creating a translatlonal fusion 
with the GUS/GFP reporter gene. 

GUS histochemlcal assay. 

20 Arabidopsis ttialiana plants were transformed with this construct according to the 
vacuum transformation method (Bechtold & Pelletier 1998). Transformants were 
selected for phosphinothricin-tripeptide (PTT, Duchefa, NL) resistance. 
Htstochemical localization of p -Glucuronidase activity (GUS) was performed using the 
substrate 5-bromo-4-chloro-3-indolyl glucuronide (X-gluc, Blosynth AG, Staad, 

25 Switzerland). X-gluc was dissolved in n-dimethyl-formamide and diluted to 0.5 mg/ml in 
50 mM NaP04, pH 7.2 with 0.1% Triton X-100. Oxidative dimerization of the produced 
indoxyl derivative was enhanced by adding the oxidation catalyst 
ferricyanide/ferrocyanide to a final concentration of 0.5 mM (0.5 mM K4Fe(CN)6.H2O/0.5 
mM K3Fe(CN)6). The GUS reaction was incubated at 37°C over several time periods, 

30 and pictures were taken from the cloured plant-tissues (Figures 7 and 15). In Figure 7 
the very specific expression pattern of the RCH1 promoter is visualized by the GUS 
reporter gene activity. Persons skilled in the art will also agree to the fact that the 
expression level of the RCH 1 promoter is very high, since after one hour the saturation 
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Example 5 : Root-specific reporter gene expression in Rice mediated by the 
Arabidopsis thaliana RCH 1 promoter. 

CLAVATA homologs are found in many other plant ispecies from both the groups of 
monocots and dicots. Therefore one can assume that also the promoter of RCH1 is 

5 functional in those other plant species, and possibly exerts the same specific expression- 
pattern. To investigate the feasibility of root meristem-specific activation of the 
Arabidopsis tiiaiiana RCH1 promoter in monocots, the aforementioned promoter-reporter 
gene construct, wherein the GUS/GFP chimeric gene is operabiy linked to the RCH1 
promoter, will be transformed to rice using the standard transformation procedures well 

10 known to the persons skilled in the art and outlined in the following paragraph. After 
several time periods ranging from 1 day to 1 or more weeks, the seedling will be 
checked for the expression of the GUS reporter gene. This will be done by growing the 
seedlings in organogenesis medium, and staining them with glucuronidase for several 
time periods ranging from 1 hour to 5 hours. 

15 

Agrobacterium-mediated rice transformation 

The Promoter-GUS hybrid gene outlined in Example 2 can be transformed to 
Agrobacterium tumefaclens strain LBA4404 or C58 by means of eiectroporation and 
subsequently transformed bacterial cells can be selected on a solid agar medium 

20 containing the appropriate antibiotics. 

For demonstration of root-specific GUS expression with the promoter of the current 
invention, 309 mature dry seeds of the rice japonlca cultlvars Nipponbare or Taipei are 
dehusked, sterilised and germinated on a medium containing 2,4-dichlorophenoxyacetic 
acid (2,4-D). After incubation in the dark for four weeks, embryogenic, scutellum-derived 

25 call! are excised and propagated on the same medium. Selected embryogenic callus is 
then co-cultivated with Agrobacterium. Co-cultivated callus is grown on 2,4-D-containing 
medium for 4 to 5 weeks in the dark in the presence of a suitable concentration of the 
appropriate selective agent. During this period, rapidly growing resistant callus islands 
develop. After transfer of this material to a medium with a reduced concentration of 2,4-D 

30 and incubation in the light, the embryogenic potential is released and shoots develop in 
the next four to five weeks. Shoots are excised from the callus and incubated for one 
week on an auxin-containing medium from which they can be transferred to the soil. 
Hardened shoots are grown under high humidity and short days in a phytotron. Seeds 
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lovet of staining is reached in the meristem of the main root from a 1 week old seedling 

{Figure 7 D). 

RCH1 ::GFP expression in roots is shown in Figure 16. 

5 Example 3 : Alignment. 

Soquence alignment of one complete register sequence against a target sequence as in 
hgure tO was done using the program GAP or the GOG package. The algorithm of 
MoGdieman and Wunsch is applied here to find the alignment of two complete 
sequences (Needteman and Wunsch, JMB 48(3): 443-453, 1970). The used parameters 
10 dunrtg the alignment were Gap Weight: 8, Average Match: 2.912, Length Weight: 2, 
Average Mismatch: -2.003, Quality: 1388, Length: 1149, Ratio: 1.416, and Gaps: 25. 

Example 4 : Phylogenetic tree. 

Fof aoquence comparison of proteins as in figure 11 and 12, the Ciustal V method to 

15 a^jn aU the sequences was used (Higgins DG & Sharp PM, Comput AppI BioscL 1989 
Apr.5(2): 151-3). The Ciustal Method groups sequences into clusters by examining 
sequence distances between all pairs. Clusters are aligned as pairs then collectively as 
sequence groups to produce the overall alignment. After the multiple alignment is 
completed, a neighbor-joining method is employed to reconstruct phylogeny for the 

20 putatwe alignment, 

Th« Ciustal method uses weight tables to construct multiple alignments. Residue weight 
tables are used in scoring protein and nucleotide alignments so that slightly mismatched 
resKkses, such as lie vs. Val amino acids or G vs. R nucleotides, are not scored the 
same as total mismatches. For the alignment and pylogenetic trees in figure 11 and 12 

25 ih^ PA^4250 weight table was used. Other method parameters for pairwise alignment 
wef© K-tuple: 1, Gap Penalty: 3, Window: 5 and Diagonals: 5. Other method parameters 
fof mufiiple alignment include Gap Penalty; 10 and Gap Length Penalty: 10. 
To view phylogenetic relationships compatible with multiple sequence alignments a 
Ptryiogenetic Tree was caiculated. The length of each pair of branches represents the 

.vt» distance between sequence pairs, while the units at the bottom of the tree indicate the 
number of substitution events. The phylogenetic tree gives a first estimate of the 
reiatonship between homologous sequences. A cladogram presentation of the 
phylogenetic tree is shown in figures 1 1 and 1 2. 
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can be harvested three to five months after transplanting. The method yields single locus 
transformants at a rate of over 50 % (Chan et ai. 1 993;H!et at al. 1 994) 

Example 6 : Expression of a cytokinine oxidase cDNA under the root-specific 
5 RCH1 promoter in transgenic Tobacco and Rice leads to increased root 

production. 

The AtCKXI gene (located in the database with the accession number: AC002510, 
Arabidosis thaiiana chromosome 11 section 225 of 255 of the complete sequence, 
sequence from done T32G6) can be cioned under the control of the RCH1 promoter of 

10 Arabidopsis, in a binary vector. Also the ZmCKXI from maize (accession number 
AF044603, (Morris et al. 1999) can be operably iini<ed to the RCH1 promoter a binary 
vector. These genes can be transfected into Tobacco or Rice plants, using 
agrobacterium-mediated gene transfer. The transgenic plants expressing the ATCKX1 or 
the ZmCKXI specifically in roots are expected to show increased root production without 

15 negatively affecting shoot development and without inducing premature leaf 
senescence. 

Example 7 . Minimal promoter sequence. 

The length of the RCHl promoter region (SEQ ID NO 2), which confers to root specific 
20 expression patterns, can be reduced to a "minimal promoter" without altering the root 
specific expression patterns or other features of the promoter such as constitutive and 
strong activity. As an example, the length of the promoter region was reduced to a 
minimal promoter sequence ranging from the -1 base pair just before the start codon at 
position 3479-3481 of SEQ ID NO. 2 to -842 base pairs before said start codon. Said 
25 minimal promoter is represented in SEQ ID NO. 18, This minimal promoter was PGR 
amplified and translationally fused to the GUS reporter gene. With a GUS histochemical 
assay (example 2) the same expression pattern as the large promoter region was 
demonstrated (results not shown). The difference is that the sequence before the Hindlli 
site at the beginning of the minimal promoter is not necessary for the root specific 
30 expression pattern. 
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Example 8 . RCH1 prom::GUS expression in embryo's. 

Siiiques of Arabidopsis plants transformed with RCH1prom::GUS were opened at 
ditiorent stages of development and the embryo's were surgically taken out of the ovuies 
arxi transferred to X-giuc staining soiution. Embryo's Vt/ere stained for 0.5h ■- 4h before 
5 p*c!uros v^ere taken, using light microscopy, Fig 15 A and B are at 25X and 40X 
rragndication respectiveiy, stained for 0.5 h. Longer staining did not reveal expression 
outsKJe the root promeristem region. 

BCH1 is active very early: RCH1 expression was first observed in torpedo stage 
err^fyo's in the root promeristem and was maintained in this region during the later 
10 stagos of development. This figure shows that as soon as a meristem is formed RCH1 is 
expressed and provides further evidence that RCH1 expression is con-elated with the 
appearance and presence of a root meristem from the embryonic stage onwards. 



I Example 9 . Isolation of cDNA of otiier Arabidopsis RCH1 homoiogues 
Table 4: % identity with RCH1 



1 



RCHO 



I BCH1. 5 



Database entry 
genomic DNA 



AB028621 + 
AP002037 



AC015446 



Database entry 
protein 



BABQ3091 



CAA1826 or 
T05050 



AAG12526 



% sequence 
identity with 
RCH1 on aa level 



59.8% 



42.1% 



To discover the exact function of RCH1, the inventors want to circumvent the 
rtxJuodancy of the gene. 

AB028621 (see figure 12, phylogenetic tree) is determined to be RCH1.2 protein and is 
also used by the inventors for further experiments. Double knocl< outs of RCH1 and 
RCH1.2 were made and also other closely related sequences (AB011476.pro, 
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T05050.pro and AC015446.pro) are used in knock out experiments or in antisense 
expression. 

Ectopic expression of RCH1 

The expression of RCH1 in normai ceils correlates with the zone of rapidly dividing cells 
5 and is involved in the maintenance of meristem celis and dividing ceiis. Improved activity 
of RCH1 in a host ceii results in the maintenance of cell division in that host ceii and thus 
works against the differentiation of that cell. On the contrary diminished RCH1 activity in 
a host cell results In a diminished cell division in that host cell and thus in the loss of the 
amount of dividing cells in the meristem. Particular examples to influence these 
10 processes by altering the expression level of the RCH1 gene are described in the 
following paragraphs: 

Overexpresslon 

In order to obtain higher activity of the RCH1 protein in the host ceil the coding region of 
15 the RCH1 gene is fused to a strong promoters that is inducible or constitutive and/or 
ubiquitous or tissue specific. 

To influence the RCH1 activity in a particular set of host ceils, the RCH1 coding region is 
cloned under control of other meristemic promoters such as the promoters of 
WUSCHEL, CLV1, CLV2, CLV3, STM. The obtained effect is the control of the 
20 expression of RCH1 in a particular area of the meristem or in a particular time period of 
the development of the plant. Also the promoters of other RCH genes (such as 
AB028621.pro. AB011476.pro, T05050.pro and AC015446.pro) are used for this 
purpose. 

In a particular application of the present invention, the RCH1 gene and the RKN gene (= 
25 the rice homologue of RCH1 as described in WO/04761) are operationally linked to the 
root-specific promoter PYK10, a constitutive ubiquitin or GOS 2 promoter and Cdc2a 
promoter and expressed in Arabidopsis or in rice. 
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Do wnregulation 

In order to obtain lower activity of the RCH1 protein in the host cell, the coding region of 
the RCH1 gene cloned in a sense and antisense orientation in order to form a hairpin 
that induces gene-silencing in the host cell. 

5 

Co-expression 

Downregulatlon or upregulation of the RCH1 in combination with downregulation or 

upregulation of other genes e.g.: 

WithWUSHEL: 

10 In the shoot meristem a balance between the WUSCHEL and CLAVATA (CLV) genes 
has been implicated to play an essential role in regulation ceii division and differentiation 
(Clari< et ai. 1993;Fletcher et al. 1999;Schoof el ai. 2000). The whole meristem 
maintenance is controiled by these genes and they seem to have an opposite function 
i<eep!ng the balance beti,veen eel! division and differentiation. Therefore, WUSCHEL or a 

15 root WUSCHEL homoiogue is a preferred partner to be co-expressed with RCH1 . 

In an application of the invention RGH1 is upregufated in combination with the 
downregulation of WUSCHEL in a root specific manner. The obtained effect of this co- 
expression is a better control on the meristem maintenance. 
With other RCH1 family members 

20 Possibly the fur^ction of RCH1 is redundant and other closely related genes can take 
over its function. Therefore the RCH1 gene ectopically expressed in combination with 
closely related genes in order to downregulate or upregulate the activity of the encoded 
protein. The RCH1 homologues used for co-expression are for example AB028621 .pro, 
AB011476.pro, T05050.pro and AC015446 (see figure 12 and table 4). The obtained 

25 effect is a better effect as described for the ectopic expression of RCH1 alone. 
With other Clavata homologues: 

Clavata 1 is a receptor that is involved in a signal pathway probably through the binding 
of a ligand (e.g. Clavata 3). A particular useful application of the present invention is the 
co-expression of RGH1 with the root Clavata 3 homoiogue RCH3, The obtained effect is 
30 a better control on meristem maintenance. 
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Example 1 0 : Domain swapping 

The N-terminal LRR domain of RCH1 functions as an extracellular perception domain 
and a C-terminal kinase domain functions in an intracellular signal transduction pathway. 
Previously it was described by He et al. (Science 2000, 288(5475):2380-3: Perception of 

5 brassinosteroids by the extracelluiar domain of the receptor kinase BRI1) that such 
domains can be swapped with the domains of other proteins. Following this methodology 
in a particular application of the present invention, the domains of RCH1 are swapped 
with the domains of another protein. In one example the LRR domain of RCH1 is 
replaced by the LRR domain of BRI. The effect obtained is tiie activation of a different 

10 pathway, which is normally not activated by RCH1 , because tiie kinase domain of RCH1 
now responds to another external signal that normally is not recognized by RCH1 . 

It will be clear that the invention may be practiced othenwise than as particularly 
described in the forgoing description and examples. Numerous modifications and 
L5 variations of the present invention are possible in light of the above teaching and, 
therefore, are within the scope of the appended claims. 
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CLAIMS 

1 . An isolated nucleic acid conprising a transcriptional regulatory root promoter or a 
functlonai fragment of such a promoter and/or comprising a nucleic acid sequence 
encoding a novel LRR receptor-like kinase (protein) or an immunologically active 
5 and/or functional fragment of such a protein (kinase) seiected from the group 
consisting of: 

(a) a nucleic acid comprising the DNA sequence as given in any of SEQ ID NOs 1 , 
2, 3 or 1 8, or the complement thereof, 

(b) a nucleic acid comprising the RNA sequences corresponding to any of SEQ ID 
10 NOs 1, 2, 3 or 18, or the complement thereof, 

(c) a nucleic acid speoifioally hybridizing with the nucleotide sequence as defined in 

(a) or (b), 

(d) a nucleic acid encoding a protein comprising the amino acid sequence as given 
in SEQ ID NO 4, SEQ ID NO 5 or SEQ ID NO 19, 

15 (e) a nucleic acid which is degenerated, as a result of the genetic code, to a 
nucleotide sequence as defined in any of (a) to (d), and which sequence codes 
for a protein having the amino acid sequence as given in SEQ ID NO 4, SEQ ID 
NO 19, 

(f) a nucleic acid which is diverging, due to the differences in codon usage between 
20 the organisms, to a nucleotide sequence as defined in any of (a) to (d), and 

which sequence codes for a protein having the amino acid sequence as given in 
SEQ ID NO 4, SEQ ID NO 19, 

(g) a nucleic acid which is diverging, due to the differences between alleles, to a 
nucleotide sequence as defined in any of (a) to (d), and which sequence codes 

25 for a protein having the amino acid sequence as given in SEQ ID NO 4, SEQ ID 

NO 19. 

(h) a nucleic acid encoding a fragment of a protein encoded by a DNA sequence as 
given in SEQ ID NO 1 or SEQ ID NO 3 or encoding a fragment of a protein 
encoded by a nucleic acid as defined in any one of (a) to (g), wherein said 

30 fragment comprises the sequence as represented in SEQ ID NO 5, and, 



PCT/'EP01/141S4 



(k) a nucleic acid encoding a protein as defined in SEQ ID NO 4, SEQ ID NO 19 or a 
nucleic acid as defined in any one of (a) to (h), said nucleic acid interrupted by 
intervening DNA, 

provided tliat said nucleic acid is not one of the nucleic acids as deposited under the 
5 GenBank Accession numbers AB01 7061 , AV538507, AV520681 or AQ96641 9. 

2. An isolated nucleic acid comprising a transcriptional regulatory root promoter 
comprising the sequence as given in any of SEQ ID NOs 1 , 2 or 1 8. 

3. An isolated nucleic acid comprising a transcriptional regulatory root promoter 
consisting of the sequence as given in SEQ iD NO 2 or 18, or a functional part of 

10 said sequence wiiich is able io regulate gene expression in a root specific manner, 
provided that said nucleic acid is not one of the nucleic acids as deposited under the 
GenBank Accession numbers AV538507 or AV520681 . 

4. An isolated nucleic acid comprising a transcriptional regulatory root specific promoter 
comprising: 

15 (a) at least a functional part of the DNA sequence as given in any of SEQ ID NO 1 , 
2 or 18, and, 

(b) a second transcriptional regulatory sequence, optionally a regulatory promoter 
sequence not normally exhibiting root-specificity. 

5. An isolated nucleic acid according to claim 4 wherein said second transcriptional 
20 regulatory promoter sequence is the ubiquitin pronrrater. 

6. An isolated nucleic acid selected from the group consisting of: 

(a) a nucleic acid consisting of at least part of the DNA sequence as given in SEQ 
ID NO 1 or 3, or the complement thereof, 

(b) a nucleic acid encoding a protein as given in SEQ ID NO 4 or 19 or encoding a 
25 fragment of said protein, wherein said fragment comprises the sequence as 

represented in SEQ ID NO 5, and 

(c) a hucieic acid encoding a protein with an amino acid sequence which is at least 
65 % identical to the protein as given in SEQ ID NO 4 or 9, wherein said amino 
acid sequence comprises the sequence as represented in SEQ ID NO 5, 
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or a homologue or a derivative of said protein, or an immunologically active and/or 
functional fragment thereof. 

16. A polypeptide as defined in claim 15 being a root CLAVATA homologue or a 

functional homologue thereof 

5 17. An isolated polypeptide comprising the sequence represented in SEQ iD NO 5 
encodabie by a nucleic acid of any of claims 1 to 7, or a homologue or a derivative 
thereof, or an immunologically active and/or functional fragment thereof. 

1 8. A method for producing a polypeptide according to any of claims 1 5 to 17 comprising 

culturing a host eel! of claim 13 or 14 under conditions allowing the expression of the 
10 polypeptide and recovering the produced polypeptide from the culture. 

19. An antibody specifically recognizing an LRR receptor-like kinase (protein) of any of 
claims 15 to 17 or recognizing immunologically active parts or specific epitopes 
thereof. 

20. A method for the production of transgenic plants, plant cells or plant tissues 
15 comprising the introduction of a nucleic acid molecule according to any of claims 1 to 

7 in an expressible format or a vector according to any of claims 10 to 12 in said 
plant, plant ceil or plant tissue. 

21 . The method of claim 20 further comprising regenerating a plant from said plant cell. 

22. A transgenic plant cell comprising a nucleic acid sequence of any of claims 1 to 7 
20 which is operably linked to regulatory elements allowing transcription and/or 

expression of said nucleic acid In plant cells or obtainable by a method of claim 20 or 

21. 

23. The transgenic plant cell of claim 22 wherein said nucleic acid of any of claims 1 to 7 
is stably integrated into the genome of said plant ceil, 

25 24. A transgenic plant or plant tissue comprising plant cells of claim 22 or 23. 

25. A harvestable part of a plant of claim 24. 

26. The harvestable part of claim 25 which is selected from the group consisting of 
seeds, leaves, fruits, stem cultures, rhizomes, tubers and bulbs. 

27. The progeny derived from any of the plants or plant parts of any of claims 24 to 26. 
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characterised in that said nucleic acid encodes a novel LRR receptor-like kinase 
protein or an immunologicaHy active and/or functional fragment of such a protein, 
and further provided that said nucleic acid is not one of the nucieic acids as 
deposited under the GenBank Accession numbers AB017061 or AQ966419. 

5 7. An isolated nucleic acid according to any of claims 1 to 6 which is DNA, cDNA, 
genomic DNA, synthetic DNA, or RNA wherein T is replaced by U. 

8. A nucleic acid rnolecuie of at least 15 nucleotides in length hybridizing specifically 
with a nucieic acid of any of claims 2 to 7. 

9. A nucleic acid molecule of at least 15 nucleotides in length specifically amplifying a 
10 nucleic acid of any of claims 2 to 7. 

1 0. A vector comprising a nucleic acid sequence according to any of claims 1 to 7. 

11. A vector according to claim 10 which is an expression vector wherein said nucleic 
acid sequence encoding a novel LRR receptor-like kinase or an immunologically 
active and/or functional fragment thereof, is operably linked to one or more control 

15 sequences allowing the expression in prokaryotic and/or eukaryotic host cells, 

12. A vector according to claim 10 wherein said vector comprises at least part of a 
nucleic acid according to any of claims 2 to 7 and wherein said transcriptional 
regulatory root promoter or functional parts thereof is operably linked to one or more 

genes of interest. 

20 13. A host cell containing a nucleic acid molecule according to any of claim 1 to 7 or a 
vector according to any of claims 10 to 12. 

14. The host cell according to claim 13, wherein said host cell Is a bacterial, insect, 
fungal, plant or animal cell. 

15. An isolated LRR receptor-like kinase comprising one of the polypeptides selected 

25 from the group consisting of: 

(a) a polypeptide as given in SEQ ID NO 4 or SEQ ID NO 19, 

(b) a polypeptide comprising the amino acid sequence as given in SEQ ID NO 5, 

and, 

(c) a polypeptide encoded by a nucleic acid as given in SEQ ID NO 1 or 3, 
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28. A method for conferring root-specificity, root-meristenn-speciflcity, root-vascuiar- 
tissue-specificity, root-endodermis-specificity, root-cortex-specificity, root-epidermis- 
specificity, root-quiescent-center-specificlty, and/or abundant expression in root 
and/or early expression In root to otiier promoter sequences comprising tiie fusion at 

5 least a functional part of tlie DNA sequence as given in SEQ ID NO 1, 2 or 18 to a 
second transcriptional regulatory promoter sequence normally not exhibiting root- 
specificity. 

29. A method for root-specific expression of a gene(s) of interest comprising operably 
linking of said gene(s) of interest to a transcriptional regulatory root-specific promoter 

10 as defined in any of claims 2 to 5. 

30. A method for modifying ceil fate and/or plant development and/or plant morphology 
and/or plant biochemistry and/or plant physiology comprising the modification of 
expression in the meristem of the main and the lateral roots, in the vascular tissue of 
the root or in the lateral root primordia of a gene(s) of interest operably linked to a 

15 transcriptional regulatory root-specific promoter as defined in any of claims 2 to 5. 

31 . A method for phytoremediation comprising the expression of a gene(s) of interest 
under the control of a transcriptional regulatory root-specific promoter as defined in 
any of claims 2 to 5. 

32. A method for environmental remediation comprising the expression of a gene(s) of 
20 interest under the control of a transcriptional regulatory root-specific promoter as 

defined in any of claims 2 to 5. 

33. A method for enhancing plant growth and crop yield comprising the expression of a 
gene(s) of interest under the control of a a transcriptional regulatory root-specific 
promoter as defined in any of claims 2 to 5. 

2.5 34. A method for conferring enhanced resistance to pathogens which attack the 
belowground plant tissue comprising the expression of a gene(s) of interest under 
the control of a transcriptional regulatory root-specific promoter as defined in any of 

claims 2 to 5. 

35. A method for enhancing freezing tolerance in plants comprising the expression of a 
30 gene(s) of interest under the control of a transcriptional regulatory root-specific 
promoter as defined in any of claims 2 to 5. 
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36. A method for enhancing resistance of a plant to drought and/or high salt comprising 
the expression of a gene(s) of interest under the control of a transcriptional 

regulatory root-specific promoter as defined in any of claims 2 to 5. 

37. A method for the production of light-sensitive proteins comprising the expression of a 
5 gene encoding said light-sensitive protein under the control of a transcriptional 

regulatory root-specific promoter as defined in any of claims 2 to 5. 

38. A method for root specific gene silencing comprising the expression of an antisense 
nucleotide sequence to the gene of interest, a gene silencing construct or a ribozyme 
under the control of a transcriptional regulatory root-specific promoter as defined in 

10 any of claims 2 to 5. 

39. A method for stimulating root meristem formation comprising the expression of a 
gene that influences the proliferation and/or differentiation of the root meristem under 
the control of a transcriptional regulatory root-specific promoter as defined in any of 
claims 2 to 5. 

15 40. A method for stimulating root meristem formation comprising the expression in 

particular ceils, tissues or organs of a plant, of a nucleic acid encoding an LRR 
receptor-like kinase operabiy linked to a plant-operable promoter sequence. 

41 . A method stimulating root meristem formation comprising the expression in particular 
cells, tissues or organs of a plant, of a nucleic acid encoding an LRR receptor-like 

20 kinase as defined in any of claims 15 to 17 operabiy linked to a plant-operable 
promoter sequence. 

42. A method stimulating root meristem formation comprising the expression in particular 

ceils, tissues or organs of a plant, of a nucleic acid encoding an LRR receptor-like 
kinase under the control of a transcriptional regulatory root promoter as defined in 
25 any of claims 2 to 5. 

43. A method for stimulating root meristem formation comprising the expression in 
particular cells, tissues or organs of a plant, of a nucleic acid encoding a polypeptide 
as defined in any of claims 15 to 17 under the control of a transcriptional regulatory 
root promoter as defined in any of claims 2 to 5. 
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4.; A method for root-meristem maintenance comprising the expression of a gene that 
influences the proliferation and/or differentiation of the root meristem under the 
control of a transcriptional regulatory root-specific promoter as defined in any of 

claims 2 to 5. 

5 45 A method for root-meristem maintenance comprising the expression in particular 
ceils, tissues or organs of a plant, of a nucleic acid encoding an LRR receptor-iil^e 
kinase operabiy linked to a piant-operable promoter sequence. 

46 A method for root-meristem maintenance comprising the expression in particular 
colls, tissues or organs of a plant, of a nucleic acid encoding an LRR receptor-li!<e 
10 ksnase as defined in any of claims 15 to 17 operabiy linked to a plant-operabie 

promoter. 

A 7 A method for root-meristem maintenance comprising the expression in particular 
c«MiS, tissues or organs of a plant, of a nucleic acid encoding an LRR receptor-like 
kinase under the control of transcriptional regulatory root promoter as defined in any 
i> of daims 2 to 5. 

48 A method for root-meristem maintenance comprising the expression in particular 
cells, tissues or organs of a plant, of a nucleic acid encoding a polypeptide as 
defined in any of claims 15 to 17 under the control of a transcriptional regulatory root 
promoter as defined in any of claims 2 to 5. 

:tj 49 A method for enhancing root formation and/or root growth comprising the expression 
m particular cells, tissues or organs of a plant, of a nucleic acid encoding an LRR 
roceptor-like kinase operabiy linked to a plant-operable promoter sequence. 

50 A method for enhancing root formation and/or root growth comprising the expression 
>n particular cells, tissues or organs of a plant, of a nucleic acid encoding an LRR 

roceptor-iike kinase as defined in any of claims 15 to 17 under the control of a plant 

operable promoter sequence. 

5 1 A method for enhancing root formation and/or root growth comprising the expression 
in particular cells, tissues or organs of a plant, of a nucleic acid encoding an LRR 
receptor-like kinase under the control of a transcriptional regulatory root promoter as 

30 defined in any of claims 2 to 5. 
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52. A method for enhancing root-formation and/or root growtii comprising Vne expression 
in particular cells, tissues or organs of a plant, of a nucleic acid encoding a 

polypeptide as defined in any of claims 15 to 17 under the control of a transcriptional 

regulatory root promoter as defined in any of claims 2 to 5. 

5 53. A method for enhancing overall growth and yield comprising the expression in 
particular cells, tissues or organs of a plant, of a nucleic acid encoding an LRR 
receptor-like kinase operably linked to a plant-operable promoter sequence. 

54. A method for enhancing oyerall growth and yield comprising the expression in 
particular cells, tissues or organs of a plant, of a nucleic acid encoding an LRR 

10 receptor-like kinase as defined in any of claims 15 to 17 operably linked to a plant 
operable promoter sequence. 

55. A method for enhancing overall growth and yield comprising the expression in 
particular cells, tissues or organs of a plant, of a nucleic acid encoding an LRR 
receptor-like kinase under the control of a transcriptional regulatory root promoter as 

15 defined in any of claims 2 to 5. 

56. A method for enhancing overall growth and yield comprising the expression in 
particular cells, tissues or organs of a plant, of a nucleic acid encoding a polypeptide 
as defined in any of claims 15 to 17 under the control of a transcriptional regulatory 
root promoter as defined in any of claims 2 to 5. 

20 57. A method for modifying cell fate and/or plant development and/or plant morphology 
and/or plant biochemistry and/or plant physiology comprising the modification of 
expression In particular cells, tissues or organs of a plant, of a nucleic acid encoding 

an LRR receptor-like kinase operably linked to a plant-operable promoter sequence. 

58. A method for modifying cell fate and/or plant development and/or plant morphology 
25 and/or plant biochemistry and/or plant physiology comprising the modification of 

expression in particular cells, tissues or organs of a plant, of a nucleic acid encoding 
an LRR receptor-like kinase as defined in any of claims 15 to 17 operably linked to a 
plant-operable promoter sequence. 

59. A method for modifying cell fate and/or plant development and/or plant morphology 
30 and/or plant biochemistry and/or plant physiology comprising the modification of 

expression in particular cells, tissues or organs of a plant, of a nucleic acid encoding 
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an LRR receptor-like kinase under the control of a transcriptionai reguiatory root 

promoter as defined in any of claims 2 to 5. 

60. A method for modifying cell fate and/or plant development and/or plant morphology 
and/or piant biochemistry and/or plant physiology comprising the modification of 

5 expression in particuiar cells, tissues or organs of a plant, of a nucieic acid encoding 
a polypeptide as defined in any of claims 15 to 17 under the control of a 
transcriptional regulatory root promoter as defined in any of claims 2 to 5. 

61 . A method to confer pathogen resistance to a plant comprising the expression in 
particular cells, tissues or organs of a plant, of a nucleic acid encoding an LRR 

10 receptor-like kinase operably linked to a plant-operable promoter sequence. 

62. A method to confer pathogen resistance to a piant comprising the expression in 
particular cells, tissues or organs of a piant, of a nucleic acid encoding an LRR 
receptor-like kinase as defined in any of claims 15 to 17 operably linked to a plant- 
operable promoter sequence. 

15 63. A method to confer pathogen resistance to a plant comprising the expression in 
particular cells, tissues or organs of a plant, of a nucleic acid encoding an LRR 
receptor-like kinase under the control of a transcriptional regulatory root promoter as 
defined in any of claims 2 to 5. 

64. A method to confer pathogen resistance to a plant comprising the expression in 
20 particular cells, tissues or organs of a plant, of a nucleic acid encoding a polypeptide 

as defined in any of claims 15 to 17 under the control of a transcriptional regulatory 
root promoter as defined in any of claims 2 to 5. 

65. A method for identifying and obtaining proteins interacting with a (LRR receptor-like 
kinase) polypeptide of any of claims 15 to 17 comprising a screening assay wherein 

25 a polypeptide of any of claims 1 5 to 17 is used or expressed. 

66. A method according to claim 65 comprising a two-hybrid screening system wherein a 
nucleic acid encoding a polypeptide of any of claims 15 to 17 as a bait and a cDNA 
library as prey are expressed. 
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67. A method for modulating the interaction between an LRR receptor-like kinase 
(protein) as defined in any of claims 15 to' 17 and interacting proteins obtainable by a 

method according to claim 65 or 66. 

68. A method for identifying and obtaining compounds interacting with an LRR receptor- 
like kinase (protein) comprising the steps of: 

(a) providing a two-hybrid screening system wherein a polypeptide of any of claims 
15 to 17 and a protein interacting with said LRR receptor-like kinase (protein) or 
an interacting protein obtainable by a method of claim 65 or 66 are expressed, 

(b) interacting said compound with the complex formed by the expressed proteins as 
defined in a), 

(c) detecting a second complex, wherein the presence of said second complex 
identifies a compound which specifically binds to one of said polypeptides or said 
second complex, and 

(d) identifying the compound. 

69. A method for identifying compounds or mixtures of compounds which specifically 
bind to a polypeptide of any of claims 15 to 17, comprising: 

(a) combining a polypeptide of any of claims 15 to 17 with said compound or 
mixtures of compounds under conditions suitable to allow complex formation, 
and, 

(b) detecting complex fonmation, wherein the presence of a complex identifies a 
molecule which specifically binds said polypeptide. 

70. Use of a molecule identifiable by means of a method of claim 68 or 69 as a plant 
growth regulator or herbicide. 

71 . A method for the production of a plant growth regulator or herbicide composition 
comprising the steps of the method of claim 68 or 69 and formulating the compounds 
obtained from said steps in a suitable form for the application in agriculture or plant 

cell or tissue culture 

72. Use of a nucleic acid molecule of any of claims 1 to 7, a vector of claim 1 0 to 12, a 
polypeptide of claim 15 to 17 or an antibody of claim 19 for modifying cell fate and/or 
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plant development and/or plant morphology and/or plant biochemistry and/or plant 

physiology. 

73. Diagnostic composition comprising at least one nucSeic acid molecuie of any of 
claims 1 to 7, vector of claim 10 to 12, polypeptide of claim 15 to 17 or antibody of 

5 claim 19. 
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Figure 1 
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SEQlDNo. Primer Seque n ce (5'-3') Detaiis 



SEQ ID No. 6 


LRR1 


AAYAAYTTM.ASYGGTKMDWTHCC 


Degenerate LRR primer 


SEQ !D No. 7 


LRR2 


CARYI'HAMCGGW G AAATHCC 


Degenerate LRR prinier 


SEQ iD No. 8 


LRR3 


MAYYTSAiYGGAHGTATHCC 


Degenerate LRR primer 


SEQ ID No. 9 


KiN1 


THGCWTTWGSCATGTA^JTCATA 


Degenerate kinase primer 


SEQ ID No. 10 


K!N2 


AYTCDGGDGCHAHGTADCC 


Degenerate kinase primer 


SEQ ID No. 11 


KINS 


MAYHGCRAARCTRTAVACATC 


Degenerate kinase primer 


SEQ ID No. 12 


RCH1F 


CGATCAGACACMGAACAT 


RCH1 specific primer 


SEQ !D No. 13 


RCH1R 


AGG.A.ATGGTGTTGGAAQAA 


RCH1 speciiic primer 


SEQ ID No. 14 


UBIF 


TGCAGATCTTYGTGAAGAG 


Gompl. to 5' ersd of repeating ubiquitin unit 


SEQ ID No. 15 


UBIR 


GACTCCTTOTGGATGTTG 


Compl. to 3' end of repeating sjbiqiJitir. unit 


SEQ ID No. 16 


M13F 


TGTAAAAGGACGGGCAGT 


Compl. to pBS sequence 


SEQ ID No. 17 


PRCH1R 


CAGACAGGATCCAATCGGCATTT 


Conripi. to RCH1 promoter, includes small 






GCAAAGACATA 


coding sequence and BamHi site 



N=A+C+G+T; Y=C+T; M=A+G; S=G+G; K=T+G; W=A+T; R= A+G; D=A+T+G; H=A+T+C; V=A+C+G; 
l=deoxylnosine 
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Figure 3 



C R C R C R 
IVI 23 23 22 22 21 21 




Figure 4 



RCHl 



wo 02/46439 



4/40 



PCT/EPOl/14154 



Figure 5-1 SEQ ID NO 1 : 7438 bp 

CTCGAGTTTCAG&TGTTTCTATTAAATAAGTJUiTGAAAGCAGCTCT'AATTCTCCCAAAATTCACAAGTATAGii^ 

AAAGATAGAAACAGAGTTATTGCATGGAAGCTGAAAATCATACAGCATGGTGGATATAACCAATGAGACACAAAACAATC 

ACGGCATGTGTTATAGAACAACACTTATCATCTOTATTAACRTAAAGCAAAAGTTCCTATACGACATAGTCCAAAGTTAC 

AGACCATAAGCACACCTTATGATTTGTTGTACCCTCTAAAACACAGGATGATGACAAGAAGCAAATTTGCCC^ 

TGAGGGAAAAACGAGGGCTTAACGCTCTCATATCTCAAACAAAAACTTCTCAAACAGTTTCATGTGTTCAGATTC^ 

CAATGGCAACGGATAAGCTGCaVTCATTAGTAGGACTTGGTAGCACAAAAGAGAAACCCTCGTATGGAATTCCACCAGGT 

CCCATAAAGATAGGACGACCCCAACCAAAGTCTGCATCATAAATAGGTAATCTAACCCAGCTTGTGATTCCCAAATTTGG 

GCACTTGTAGGTATGTGCACCGCGGACAAGGGCTGACAGATCAGGCTGCATCTCCAGGTAGTCAAGAGCTGACCTCAGAT 

AGTTATCATCCATACGAACCAAGAAATCATGAATCTGTCCTGCAGCATACCATGTTGGCTTAGATAACAAATCCCCTGCA 

ACAGCCAATGGTGTTGCAGTGAATATCACAO^CCCAAAGTAACCAGGAGGCAGCTGCGGACGTAGTCTAGACCTTCCATC 

AGTTGCAATGTACAGTTTCGTCTCTTGGTCGTTTGGAAGCCCTCGCGCCTTTCCCACTGATCTCCACACATGCCCTGC^^ 

ACATCTCGTATGAGCTGTAGCTGACAGTGTTCCCATCCTCCTTGGATTTCGCCTTAAGAGCAACAAGCTGGTCTCGTGTT 

AATTTGAATATAGAGACAGTGGTATTCTCAGGTCCTGATTTAGACGGATCAAGAGGTATCTTCATACTTGGTGCAGGCTG 

ATATTCAACATGATGAAAAGCAGGCTGTGGCGGGTCCCTAGCTCGGAGGAGTGTTCGATCAATGAAAGGTGGAATG6TTA 

GGTCAAGACCACGAGCCATATCAGACCATGTGTTGATAAAATGAAGACCAGAGAAACCATCTGCCGCGTGATGTTGCATC 

CCAACCCCAAGTGAAGCTCCCCCACATTTABAGAAAGTCACCTA'PATCATCAAACAATATCATTACATAACTAAAAACAG 

AACATACATCATCAi^GAATATCATTACAG'AACTAASAAACAGAACAACACAAGCATTCAAT^ 

ACTTTAAATTAAGCTGACCT-GCAAAACGAGACiGCGGGAAAGAGTGAATGCCAGCGGAGTGATCAACTTCGGGAATAAGCT 

GACGGAGAa^'AAGGGTAGGAGCAMATCACCA^TCATCGATAACAGAAGGAGTATCAGCCACAACGAAGAGAACACCA 

GCACCGTTACAATCGATCTCAATACGACCATCATCGTCTCTCTTCAAGCGACCAGCCATAGGGTAAA&AGGGACAAGGGC 

TTOGGAAAGAGCT-rCCrrCATGACCTGAGGGTCAAAGAAATTGGAAGCGCCGGTGGGTCTGTAGAAGl^AGACACT^ 

TATGGAATCTGGGGATGACAAGGTCGACGT-TGGAGTTCCA&AGATTAGTGATTGGTGTCTCGGTGGCAGGCCGGACCATG 

GTGGAATCTCTGATGTTAATTTTCATA?TT?TAAGAAS.GATCAAACGAfiCAAACAAATTTTGTTTTGAGTTTGATCTGAG 

AAAIT'GTGTGAGAAGGGAGGAGGGGTAGATATAAGTGGGTGTSAACAAAAAAGGAAAGGTCGTAATCCTTCACCTACCGC 

CGCACGACCCCGCCTACTCCATCTGTCGACOUSlCTCTCTCAGAAGGTTTATCATTGGTCCATCTCATCACTCGGTCCTAC 

TTrTCTATCCGGGGTTGGTAAACCCGTTCCCTCTGTCCTAAGGCCCAAATAATACTAA'TGGGCTTAAATAGTA'TAAAGTT 

CGAATGAAGAAGAAGGAAAAA6AATTGACA?.GTTGATTGTTATAGCTACCaTACAAGrrAGTCAATCGAAlK5CC 

TGGCTTA>^CTTAACTCATTACCACGT&GCAGAGCATAACAACAACCACCAATAACATCTAACGGCGTfiAACTCACGTGTT 

AAGAGAATTACGTCAGAATCCAAGAAAGTCTT'nrTI-TCAGTACACCACWATAAATTATGCKGAATTm^ 

TACTAO:'ATTATATGAAAACATACCTCAATTAGTCATAAATTTCACA'TTTCaTTTTA?\TTTATTAACTAATGGTT 

GCGGATAGCTTTTCa'TTGCTTTCTGTTCTA^yXAa-TATI^TCrrAGCCATT^VTTTGATGATCCAACTTGGTTTGAGGA? 

ATAAAAATTCAATTTCTTTGAAGCATTCA-TTGGCTTAATGATCCAAACCTTTTGTCCTCCCACTCTCTAAQAA/lATAAAG 

CAAAJvGCAAAACT/UUU^GAAAAGCA^iCTAGl^ACCGl'GGGrrGGTC/iATGGT/iACCCAl^^ 

ttaaatgacaatttagtaccttgggttggtcatgatt-tagagcgga/icaaatataccatacatcaaacgaggatatacag 

agaaaattcatggaagtatggaat-ttagaggac/iatltctcttctgggctacaacggaccggcccattcgctca'tttacc 

cagaggtatcgagtttgtggacttttgatgccgctagagactattggcatcggattga^iaaaaatgtttacttcgttgtt 

aacaattttctgaatgcaatattttccttgtcatgaatatttaaaottgttattactttcttttagcrraggtgtggaca 

attatggagtttacttcaaacgaggaagaatctcaaacgctcggttcaggtctcg/iaaacaaaccaactcacaatcctga 

cttaattgaggaaaacaatggaaaaccacatgcatgcttccatatttctatcataatcttat/vagaaaamcactacta'v 

gtgaaatgattctgtatatatataaccaatgccta'ttgtu'ttgtgatatt-ttatgtatatataactattoacrl'ttc^^ 

tctatggatagtgtctcqggctcttggcaaacatatttcaaagaaaagttaatgactgtaattaattaatctga/igctag 

aaacagaaccccgaggtaaaagaaaaagacagagcacatgaagtrtagtactottatatatttaatatatcattctttct 

tattgcttatctctaaagcaaaaacttccctaaaccctaagccaa„aggactcagatcgatgcagaaccaagaaggcttg? 

tttggatttgagagccaaatgcaaaaaaaaaaactcttatgtctttgcaaatgccgattccmgaaaaaaagccttaac't 

gtttctcatttttccattacattgtgtgtgtttttagctttcttcatctcctcgacctctgcatcaaccaatgwigtctc 

agctttgatttcttggcttcacagctcgaactcgccgccaccgtcagtgttctccggctggaatccttctgattctgacc 

catgtcmtggccttacattacttgttcctcctcagacaacaaactcgttacagagatcaatgtcgtctc'tgt'rcagtta 

gctttacctttccctcctaacatttcttcgtttacttcacttcagaiiactcgttatctccaacactaatctcaccggagc 

tatatcttctgagatcggagattgttcggagcttatagttatcgatttaagctcaaatagtct-tgttggtga.aatacctr 

cgagtctagggaagctcaagai^itcttcaagaactttgtttaaactccaatggcctcacaggaaagatcccaccggaactc 

ggcgactgcgrragcctcaagaatct-tgagattttcgataagtacttatccgagaatctcccgttggagctcgga/iagat 

ctcgactcttgagagcataagagccggaggaaactcagagctttcagggaagatcccggaggagatcggaaactgtagga 

atctcaaggtct-tagggt-tagcagcaacgaaaatatccggttcttl'acctgtttcgtrgggtcaactaagcaa.gctccag 

tctttatctgtctattctacaatgctctcaggtgagatccctaaagagcttggaaactgctctgaacttatcaatctgtt 

tctatacgacaatgacttgtccggcacgct'rccaaaagaa.ctaggaaagcttcaaaacctagagaagatgcttc'ratggc 

agaataatctccacggacctatccctgaggagattggattcatgaaaagcttaaacgccattgatctctctatgaactat 

ttctcaggaaccatccctaaatcgttcggtaactogtcgaatctccaagagcttatgcttagcagcaacaacatcaccgg 

gtcaataccttcgattctaagtaactgcacaaagctt'gotcagttccagattgacgccaatcagatttcaggtttgattc 

caccagagattggattgcttaaggagctcaacatcttcttgggatggcagaataagctagaagggaatattccggacgaq 

ttagctggttgtcagaatcttcaagctcttgacttgtcacagaao'tatcteaccggatcattacctgcaggrttgtttca 

gcttcgtaatctgactaagctcttgcttatatctaatgcca/rctccggtgttatcccac'rggagatcgggaactgcactt' 

cgcttgtcagattaaggcttgtcaataacagaatcaccggagajlatccctaaaggtattggatttcttcagaacct'tagc 

ttcercgact-tgtctgagaacaatctctccggtccagttcct-ttggagataagtatittgtagacaactccagatgcttaa 

cttgagcaacaacactcttcaaqgttatci^cctctgtcit:tatcatcgttaacaaagcttcaagtacttcatgtctc^^ 

coiatgacttgactggcaagatacctgatagtctaggtcatctca'ittcactcaaccggctcat-tctaagtaagaa'i'tct 

•ttcaatggagaaatcccttcgtctctcggtcactgcacgajvtcttcagcttcttgatctcagcagtaataacatctctgg 

aactataccggaggagctttttgacatccaagatctagacattgcgttaaacttgagctggaattcatta6atggcttta 

tcccggaaaggatttccgcgctta'i.ccgatratccgtgctcaatatttcgcataacatgctttw^^ 

ctgtccggtctagaaaacttggtttctctg/lacatctctcataacagattctcaggttatcttccagatagtaaagtgtt 

tagacagctgataggagcagagatggaaggaaacaatggactctgttccaaaggtttcaggtcttgctttgtaagtaaca 

gt-tcacagttaactacacagcgtggtgtgcactcacatagactcaggatagccattggattgctaatcagcgtgacagcg 
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GTTCTAGCGGTATTAGGCGTGTTGGCGG1'TA"ACGAGCTAAACAJJ^TGATTCGAGATGATAACGATTCAGAGAOT 

AAATCTATGGACATGGCAATTCACACCTTTTCAGAAACTCAACTTCACAGTCGAACATGTACTCAAGTGTTTGGTAGAAG 

GTAATGTTATAGGGAAAGGTTGCTCCGGGATAGTGTACAAAGCTGAAATGCCTAACAGAGAAGTCATCGCGGTGAAAAAG 



Figure 5-2 



CTCTGGCCAGTGACAGTACCTAATCTGAATGAGAAAACTAAGTCATCAGGAGTTCGAGACTCATTCTCAGCTGAAGTAAA 

AACACTTGGATCGATCAGACACAAGAACATTGTACGCTTCTTGGGATGrrGTTGGAACAAGAACACTAGACTTCTTATQT 

ATGATTATATGTCAAATGGGAGTTTGGGAAGTTTGCTTCACGAAAGGAGCGGTGTATGTAGCTTAGGATGGGAAGTGAGG 

TACAAGATTATACTTGGTGCAGCTCAGGGT-TTGGCTTACTTSCACCATGACTGTGTTCCTCCCATTGTTCATAGAGACAT 

CAAGGCAAATAATATTCTGATTGGCCCTGATTT-TGAACCTTACATTGGAGATTTCGGACTCGCTAAGCTTGTTGATGATG 

GCGACTTTGCTCGTTCTTCCAACAGCATTGCTGGTTCCTATGG'IT'ACATAGCTCCAGaTAC'I'TCCCAAAT'rTCTGTTTTG 

CTGCTAATTAGTTAJ^.CTTGATACATCATAAGTTACTAGTAGTC'IT'AGCCTAGTAATGGTAGAAATT'CAGGT'rTATAG'TTT 

GGTTTTGGTTTTTGGCAriTTGAAAAGAATAGTATT'rTGGTTCAGTTGAa"ri"rCGATTTGGTTTAJaiATTl^^^^^ 

AATGGa'AGAAGAAATAATTTTTGATTAATATTATATCATCTACTGTATACGA'TTTAAAAAAAAAATTGATTTGGTTTGGT 

CTCGATATTTTGCTTGTAGCOTGGTTCAATATl^TTACTATATATATGATTCATGACTTTCATACTAATCAATA'rTTG'i'T 

CGGTTTCGTTTTGAOn'AGT-CTGAAAGTGAATCATATAGTTAAAATATTGACCAATCTATCATATACTTACACGCTACAAG 

Al'T'I'TCGA'I'GTA/iTTTTGGTGTAAGCA'TTTTGGTTTTTATATGTTCATCCTTAGTAACTAACAACAAATATCAATTGGGA 

;vGCATAACGTTATATAT-TCCCAT?-J>.GGCTATAACACTATAJ'-.TTATGCAGAATACG-3ATACTCAATGAAGATAACAGAGAA 

AAGCGACGTGTACAGCTATGGAGTCGTAGTG<?.TCGAGGTACTAACGGGTAAGCAACCAATCGATCCAACGA.TACCAGATG 

GACTCCACATAGTGGACTGGGTC;iAGAAAATCAGAGACATAC.ViGTAATCGACCAAGGACl\ACAAGCAAGACCAGAGTCA 

GAGGTTGAAGAGATGATGCAAACGCTAGGAG?CGCGCTTTTATGCATTAATCCAATA£CCGAAGACAGGCC'TACAATGAA 

AGATGTGGCTGCTATGCTTAGTGAGATATGTCAAGAGAGAGAGGAATCGATGAAAGTa'GATGGTTGCTCGGGAAGTTGTA 

ACAATGGAAGAGAACGTGGCAAAGATGATTCGACTTCATCGGTTATGCAACAAACGGCTAAGTATTTGAGAAGTAGTAGC 

ACGAGTTTCTCTGCGTCTTCTTTGCTTTACTCTTCTTCTTCTTCTGCTACTTCTAATGTTAGACCAAAT(rrTAAAT 
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Figure 6 SEQ ID NO 2 

C1CC T 1 ftcA. r r TA? ^ * ^ AAr >r'C'"CT'iPT'"C""Cn(.>USJiATTCHLrtA TATf r-? hJGATAC 
81 AAAGATAGAAA-CAGAGTTATTGCATGGAAGCTGAMuVrCA TACAGCATGGTGGATATAACC2«iTGAGACACAAAACAATC 
1 6 1 ACGGCftTGTGTTATAGAACAACACT'TATC AT-CTGTATTAJi CATAAAGCAAAAGTTCCTATACGACATAGTCCAAJiGTTAC 
241 AGACCATAZIGCACACCTTATGATOTGTTGTACCCTCTAAA ACACAGGATGATGACAAGAAGCAAATO^T GCCCAAATTTAC 

3 2 1 TGAGGGAAAAACGAGGGCTTAACGCTCTCATATCTCAi^iAC Aaj'iAACTTCTCAAACAGTTTCATGTG'fTCAGATTGGAGGG 

4 0 i C AWGGCAACGGATAAGCTGCCATCATTAGTAGGAG'rTGG TAGCACAAfiAGACAAACCCTCGTATGGAATTCCACCAGGT 
481 CCCATAAAGATAGGACGACCCCAACCAAAGTCTGCATCAT AAATAGGrAATCTAACCCAGCTTGTGATTCCCAAATTTGG 
551 GCACTTGTAGGTATGTGCACCGCGGACAftGGGCTGACAGA TCAGGCTGC ATCTCCAG6TAGTCAAGAGCTGACCTCAGAT 
541 AGTTATCATGCATACGAACCAAGAAATCATGAATC-TGTCC TGCAGCATACCATGTTGGCTTAGArAACAAATCCCCTGCA 
721 ACAGCCAATGGTGTTGCAGTGAATATC ACATTCCCAAAGT A&CCAGGAGGCAGCTGCGGACGTAGTCTAGAGCTTCCATC 
801 AGTTGCAATGTACAGTTTCGTCTCTTGGTCGTTTGGAAGC CCTCGCGCCTTTCCCACTGATCTCCACACATGCCCTGCCA 
881 ACATCTCGTAT'SftGCTG'PftGCTGACAGTGTTCCCATCCTC CTTGGATTTCGCCTTAAGAGCAACAAGCTGGTCTCGTGTT 
951 AATTTG AATATAGAGACAGTGGTATTCTG AGGTCC-TGATT TAGACGGATCAAQAGGTATCTTCATACTTGGTGCAGGCTG 
1041 ATATTCAACftTGATGAAAAGCAGGCTGTGGCGGGTCCCTA GCTCGGAGGAGTGTTCGATCAATGAAAGGTQGAATGGTTA 
1121 GGTCAAGACCACGAGCCATATCAGACCATGTGTTGATAAA ATGAAGACCAGAG^>A?vCCA2'CTGCCGCGTGATGTa'GCA-TC 
12 01 CCAACCCC AAGTGAAGCTCCCCCACAT7TAAAGAA&GTCA CCTATATCATCAAJi.CA?iTATCATTACATftACTAA?.AACAG 
1281 A^CATACATCATCA/iAGAATATCATTAC AGi'LACTAASAAA G AGAACAACAC AAGCATTCAATAAATTAAAAA,\GGGAAAG 
1361 ACTTTAAATTAAGCTGACC-TGCAAAACG AGAAGCGGGAAA GAGTGAATGCCAGCGGAQTGATCAACTTCGGGAATAAGCT 
1441 GACGGAGAri'iUiGGGTAGGAGCAAAATCACeAAiiA'DC ATC GATAACAGAAGGAGTATCAGCCACAACGAAGAGAftCACCA 
1521 GCACCGTTACAATCGA-TCTCAATACGACCATCATCGTCTC TOTTCAAGCGACCAGCCATAGGGTAi^iAAAGGGACAAGGGC 
1601 TTTGG AAAG AGCTTCC-T-TCATGACCTGAGGGTCAAAGAAA TTGGAAGCGCCGGT3GGTCTGTAGAAGTAGACACTAGGGG 
1681 T ATGG AATCTGGGGATGACAAGG'rCGACGll'GGAGT-TCCA AAGATTAGTGATTGGTGTCrcGGTSQCAGGCCGGACCATG 
1761 GTGGAATCTCTGATGTTAATTTTCATATTTTTAAGAAftGA TCAAACGAACAAACAAAaTTTGTTTTGAGTT-TGATCTGAG 
1841 AAATTGTGTGAGAAGCGAGGAGGGGTAGATATMG'rGii'GT STGAAC^AAWlAGGAAAGGTCGT/iATCCTTCACCTACCGC 
1921 CCC ACGACCCCGCCTACTCCATCTGTCGACCAACTCTCTC AGAAGGTTTATCATTGGTCCATCTCATCACTGGGTCCT AC 
2001 riTTCTATCCGGGGTTGGTAAACCCGTTCCCTCTGTCCTA AGGCCCAAATAATACTA-\TGGGCl''rA/\ATAGTATi>JUiGTT 
2081 CGAATGAAGA»i<:4AAGG AAAAAAAATTGACAAGTTGATTGT TATAGCTACCGTACAAGT•TAGTCAATCGftATGCC?.AAG^^T 
2161 IXJGcm'^ACTTAACTCATTACCACGTAGCAG AGCATAACA ACAACC ACCA^iTAACATCTAACGGCGTAAAC'TCACGTGTT 
2 2 4 1 AAG AGAA'CTACG'i-CAGAATCCAAGAAAGTCTTTT-TTTCAG TACACCACAAATAAATTATGTTGAflTT'TATATCCGAACTA 

lACIfI'VTTXT irr\l\^^ip C fl 1 \ AV irA "^CT ■> ] IIA [1 AfL CIA! 1 IlGiT 
2401 GCGGATAGCTTTTCTTrTGCTTTCTGTTCTAA^ATTATTTC T-TAGGCATi'ATT-rGATGATCCAACTTGGTTTGAGGATCTA 
2 4 S 1 AT AAA.Ai»/n'CAA'riTCTTTGAAGCATTCA.TTGGCTTAATG ATCCAAACCTTTTGTCCTCCCACTCTCTAAGAAAATAAAG 
2561 CA.iVAAGCAAA/iCTAAA AGflAA AGCAACTAGTACCQTGGGT TGi37C^^ATGGTAACLXATGCTACGAa/>TACCAVrCA'U^GC 
2641 TTAAATGACAA TTTAG TACCTTGG GTTGGT CATGATTTAG AG CGGAACA AATATACCATACATCAftACGAGGATATACAn; 
272 •'A 1 G'L'^ T Itj "'^T" pi^r\ P rTlC T C TL^ CGG TA^AA ^ ? LG C CAl CfarTL^ TI\cC 

28 1 1 C^TTCTC OJZ^ ATJ" ^ ^Trt^^^ r^ CT\"" U u AfiA? ViVT'^TTTAC T'-"tti.T_ 

88 A I , r tffi A A AT TTAAACTTCT^ ^ ^ 1 T PAC rfAGr'"j fe&CA 

2 C vn. 1 A TT A^"" At J2L„ ^ 'l A J.l, r cr IlGG \<j I " \.\ t A.i,TCAlA CTCA 

141 IT T r/^ ■» ] AAA ■^^ fl_„G '^■T i,_3TA "^i. T T'" ] T^T"" CA P^^ ^h\C "'■"'JlL.'^iUV. 

^ J _±_->-'''r^ TT^TrT T TAP" \A C TTI L. ^ irTLJ> i^P \ \ 1 TT^A^xTrG^CA 

2 1 fCT fC A ^ 1 1 1 r ^ T TTT ^ AArTT T T- A''"n^^AT TAA C""'"''AC '"Atj 

Jzai A/lACAGAAC_CCC GAGGTAAAAG AAAAAGACAGAGCACATG AAGTrrAGlACTTTlATATATTTjWrATATCA TTCTTTCT 
3361 TATTGCrrATCTCTAAAGCAAAAACTTCC CTA? U^CCCT A A GCCAj^GGA CTC AGATC GATG CAGAACCA A GAAGGCTTGT 
3441 TTTGGAa'TfGAGAGCCAAATGCAAAGAAAAAAACTCTTa T GTCTTTGCAAATGCCGATT 
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Figure 8 SEQ ID NO 3 : 3960 bp 



ATGTCTTTGCAAATGCCGaTTCCAAGiiAAAAAAGCCTTAA CTGTTTCTCATTTT-TCCATTACRTTGTCTCTGTTTTTAGC 
--ITCTTCATCTCCTCGACCTCTGCATCAACCAATGAAGTC TCAGCTTTGATTTCTTGGCTTCACAGCTCGAACTCGCCGC 

ACCGTCAGTCTTCTCCGGCTGGAATCCTTCTGATTCTGA CCCATGTCAATGGCCTTACATTACTTGTTCCTCCTCAGAC 
•■ •vCA.AACTCGTTACAGAGATCAATGTCGTCTCTGT-TCAGT TAGCTTTACCTTTCCCTCCTAACATTTCTTCGTTTACT-TC 
A -rTCAGAAACTCGTTATCTCCAACACTAATCTCACCGGA GCTATATCTTCTGAGATCGGAGATTGTTCGGAGCTTATAG 
r.-ATCGATTTAAGCTCAAATAGT'CTTGTTGGTGAAATACe IVCGAGTCTAGGGAAGCTCAAGAATCTTCAAGAACTTTGT 
r: AAACTCCAATGGCCTCACAGGAAAGATCCCACCGGJUiC TCGGGGACTGCGTTAGCCTCAAGAATCTTGAGATT'rTCGA 
TA.ACTACTTATCCGAGAATCTCCCGTTGGAGCTCGGAAAG ATCTCGACTCTTGAGAGCATAAGAGCCGGAGGAAACTCAG 
.VJCTTTCAGGGAAGATCCCGGAGGAGATCGGAAACTGTAG GAATCTCAAGGTCITAGGGITAGCAGCSJiCGAAAATATCC 
CGTTCTTTACCTGTTTCGTTGGGTCAACTAAGCAAGCTCC AGTCTTTATCTGTCTA'n'CTACAATGCTCTCAGGTGAGAT 
■ ■CCTAAAGAGCTTGGAAACTGCTCTGAACTTATCAATCTG TTTCTATACGACAATGACTTGTCCGGCACGCTTCCAAAAG 
-KA-CTAGGAAAGCTTCAAAACCTAGAGAAGATGCTTCTATG GCAGAATAiATCTCCACGGACCTATCCCTGAGGAGA'rTGGA 
rrCATGAAAAGCTTAAACGCCATTGATCTCTCTATGAACT ATTTCTCAGGAACCATCCCTAAATCGTTCGGTAACTTGTC 

.AATCTCCAAGAGCTTATGCTTAGCAGCAACAACATCACC GGGTCAATACCTTCGATTCTA_AGTAACTGCACAAAQCTTG 
■.-IX-AGTTCCAGAT'rGACGCCAATCAGATaTCAGGTTTGAT TCCACCAGAGATTCGATTGCTTAAGGAGCTCAACATCTTC 
■r-!>^ATGGCAGAATAAGCTAGAAGGGA_ATArrCCGGACG AGTTAGCTGGTTGTCAGAATCTa-CAAGCTCTTGACTTGTC 
iu-AGAATTATCTCACCGGATCATTACCTGCAGGTTTGTTT CAGCT-TCGTAATCTGACTAAGCTCTTGCTTATATCTAATG 

■'■ATCTCCGGTGTTATCCCACTGGAGATCGGGAACTGCAC T rCGCTTCrrCAGAWAAGGCTTGTCAATAACAGAATCACC 

i^AGAAATCCCT/iAAGGTATTGGATTTCTTCAGAACCTTA GCTTCCTCGACTTGTCTGAGAACAATCTCTCCGGTCCAGT 
r'-C-rTTGGAGATAAGTAATTGTAaACAACTCCAGAa-'GCIT MCaTGAGC.Aft,CAACACTCTTCAAGGTrATCTTCCTCTGT 
.--rTATCATCGTTAACAAAGCTTCAAGTACTTGATGTCTC TTCC?>J\TGACTTGACTGGCAAGATACCTGATAGTCTAGGT 

.-ATCTCATTTCACTCAACCGGCTCATTCTAAGTAAGAATT CTTTCAATGGAGAAATCCCTTCGTCTC-TCGGTCACTGCAC 
■...-kATCTTCAGCTTCTTGATCTCAGCAGTAATAACATCTCT GGAACTATACCGGAGGAGCTTTTTGACATCCAAGATCTAG 
• ATTGCGTTAAACTTGAGCTGGAATOCATTA6ATGGCTT TATCCCGGAAAGGATTTCCGCGCT-TAACCGATTATCCGTG 

--'-liATATTTCGCATAACATGCTTTCAGGCGACCT'rrcTG CGCTGTCCGGTCTAGAAAACTl'GGTTTCTCTGAACATCTC 
r ■lATAACAGATTCTCAGGTTATCTTCCAGATAGTAAAGTG TTTAGACAGCTGATAGGAGCAGAGATGGAAGGAAACAATG 
-AC^TCTGTTCCAAAGGTTTCAGGTCTTGCTTTGTAAGTAA CAGTTCACAGTTAACTACACAGCGTGGTGTGCACTCACAT 
^..i.vrrCAGGATAGCCATTGGATTGCTAATCAGCGTGACAG CGGTTCTAGCGGTATTAGGCGTGTTGGCGGTTATACGAGC 
TAAACAAATGATTCGAGATGATAACGATTCAGAGACTGGA GAAAATCTAaXSGACATGGCAATTCACACCTTTTCAGAAAC 
r^AACTTCACAGTCGAACAI'GTACTCAAGTGTTTGGTAGA AGGTAATGTTATAGGGAAAGGTTGCTCCGGGATAGTGTAC 
A^AGCTGAAATGCCTAACAGAGAAGTCATCGCQGTGAAAA. AGCTCTGGCCAG'rGACAGTACCTAATCTGA;iTGAGAAAAC 
TAAGTCATCAGGAGTTCGAGAC'TCATTCTCAGCTGAAGTA AAAACACTTGGATCGATCAGACACAAGAACATTGTACGCT 

r-ri-GGGATGTTGTTGGAy\CAAGAACACTAaftC'rTCTTA-r GTATGATTATATGTCAAATGGGAGTITGGGAAGTTTGCTT 

■acgafwiggagcggtgtatgtagcttaggatggg/^agtga ggtacaaqattatact'tggtgcagctcagggtttggctta 
:rr:-gcaccatgactgtg'i'tcctccca':ctgttcatagagac atcaaggcaaataatattctgattggccctgatiti'gaac 
.-.-rracattggagatttcggactcgctaagcttgltgatga tggcgactttgctcgttcttccaacaccattgctggttcc 
t atggtoacatagctc caggta cttcccaaatttctgtto tgctgctaattagttaacttgatacatcataagttactag 
t agtcttagcctagtaatggtagaaattcaggtttatagt taxsgttto^&tam'tggc atto?gaaaagaatagtatttt 
ocrrrcagttgattttcgatttggtttaaaa'mtagaaaa aaaatcgtagaagaaataatttttgattafttattatatca 
tctactgtatacgatttaaaaaaaaaattgatttggtttg gtctcgatattttgcttgtagcttggttcaatatttttac 
tatatatatgattcatgactttcatactaatcaatatt-tg rtcggtttcgttttgattagtctgaaagtgaatcatatag 
rraaaatattgaccaatgtatcatatacttacacqctaca agattttcgatgtaattttggtgtaagcattttggttttt 
atatgttcatccttagtaactaacaacaaatatcaattgg gaagcataacgttatatattcccataaggctataacacta 
-■ aattat gcagaat acggatactcaatgaagataacagag aaaagcgacgtgtaca6ctatgga6tcgtagtgctcgagg 
tactaacgggtaagcaaccaatcgatccaacgataccaga tggactccacatagtggactgggtcaagaaaatcagagac 
atacaagtaatcgaccaaggactacaagcaagaccagagt cagaggttgaagasatgatgcaaacgctaggagtcgcgct 
tttatgcattaatccaatacccgaagacaggcctacaatg aaagatgtggctgctatgcttagtgagatatgtcaagaga 
< ;agaggaatcgatgaaagttgatggttgctcgggaagttg taacaatggaagagaacgtggcaaagatcattcgacttca 
tcggttatgcaacaaacggctaagtatttgagaagtagta gcacgagtttctctgcgtcrrctttgctttactcttcttc 
rrcttctgctacttctaatgotagaccaaatcttaaataa 
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Figure 9 



1 >5SLi;gpi PKKKALTVSHFSITLSLFLAFFI SSTSASTNEV SALl SWLHSSNSPPPSVFSGWNPSDSDPCQWPYITCSSPD 

81 NKt.VTEINWSVQLALPFPPNISSFTSLQKI,VISN'rN]:.TG AISSEIGDCSELIVIDLSSNSLVGEIPSSLGKLKNLQELC 

161 LNSNGLTGKIPPELGDCVSIJCNI.EIFmn.SENLPI£U3K ISTi:.ESIRAGGNSELSGKIPEEIGNCKNLKVI.GI,AATKIS 

241 GSLPVSLGQLSKLQSDFVYSTMLSGEIPKELGMCSELINL FLYDNDLSGTLPKELGKLQNLEKMLLWQNNLHGPIPEEIG 

321 FMKSLNAIDLS»mFSGTIPKSFGNL£OT.QEIjmSSNNIT GSIPSILSDCTia:.VQFQIDJajQISGLIPPEIGLLKEI.NIF 

401 LGWQNKLE6Iin:PDELAGCQNLQALDLSQNYr.TGSI.PAGLF QLmJLTKLLLlSNAISGVIPLETGNCTSLVRIiRLVNNRIT 

4 81 GEIPKGIGFLQNLSFDDLSENNLSGPVPI.E1SNCRQLQML NLSNNTLQGYLPLSI.SSLTKLQVLDVSSNDLTGKIPDSLG 

561 HLISLNRLILSKNSFNGEIPSSLGHCTNLQLLDLSSNNIS GTIPEELFDIQDLDIALNLSWNSI.DGFIPERISALNRLSV 

641 LDISHNMLSCDLSALSGLENLVSLNISHNRFSGYLPDSKV FRQLIGAEMBGIilNGLCSKGFRSCFVS^^SSQLTTQRGyHSH 

721 RLRI AIGLLI SsrTAVLAVI.G'/LAVTRAKQKlRDDNDSmG EIS.VMQFT'PFQKtNF'I'^/EHVLXCL^/EOTjlGKGCSGIW 

801 IC^EMPl^EVIAX.' Kja.WPVTlfp^aiT^KTKSSg'/RDSFSaE V KTLGSXPJ^a^I'/RPLGCC»iK>ITRI;L^!YBWSS^^GSI;GSLL 

881 HERSGV'CSLGWEV lSYia:iLg&fi.GGW-^ .l,HHDCVggf^^ IKM^tTILtGPDFEPYlGDFGIJC-ajVDDGDFARSSNTIAGS 

961 YGYIAPEYGYSMKITEKSDVYSyGWVLEVLTGKQPIDPT IPDGLHXVX)WVKKIRDIQVTDQGLQARPESEVEE^SMQTLG 

1041 VALLCINPIPEDRPTMKDVAAMLSEICQEREESMKVDGCS GSCNNGRERGKDDSTSSVMQQTAKYLaSSSTSFSASSLLY 

1121 SSSSSATSNVRPNLK 



wo 02/46439 



10/40 



PCT/EPOl/14154 



Figure 10-1 

CLVl 1 majrorllkthl . . .IfrnlvlffsDcfayt.dir.evllnlkssm 38 

I -i Mill 1! 1:. i 

RCHl 1 MSLQMPTPRKXM:.TySHFSITLSL-PLAFFISSTSAS™EX/SAL-ISWIiHSS 5 0 

39 iCTDkghglhdwihssspdahcsf sgvscdddarvislnv. . 77 

1 IN : i |. i :|i 

5 1 NSPPPSVFSGWNPSDSDPCQWPYIT-CSSPDNKLVTEINV^/SVQLALPFPP 10 0 

78 . . .sftp. ........ . Ifgtispeigialthlvnltlaanaf tgelplem 114 

III I I II III . h : l-.j- Ihl : 

101 NISSFTSLQKLVISNTNLTGAISSEIGDCSELIVIDLSSNSLVGEIPSSL 150 

115 ksltslkvlnisnngnltgtfpgeilkamvdlevldtyimnfngklppem 164 

I -I - I :--ll III III M- :-l . II h 

151 GKLl^NLQELCLNSm.hTGKIPPE.hGDCVSLmLBlFDNTLSENLPLEL 198 

165 selkklkylsfggnf .fsgeipesygdiqsleylglngaglsgkspafls 213 

-■■ I- ^ IN IMII I- Hi 0 1 I I 

199 GKISTLESIRAGGNSELSGKlPEEIGHCRlvTLKVLGLAATKISGSLPVSLG 248 

214 rlknlremyigyynsytggvprefggltkleildiRasctltgeiptslsn 2 63 

.1 i. ::: | .| | -.1 | : \.\ : \ | 

249 QLSKLQSLW.YSTMLSGEIPKELGNCSELraLFLYDHDLSGTLPKELGK 297 

264 Ikhlhtlf Ihinnltghippelsglvslksldlsinqltgeipqsf inlg 313 

I--I : I III i III-- : II ■••MM ■IIMIII 
298 LQin^EKMLLWQSranjHGPIPEEIGFiyKSMAIDLSMlSryFSGTIPKSFG]^ 347 

314 nitlinlfrnnlygqipe aigelpkl 339 

|: M MM II 11 I -I 

348 NLQELMLSSNNITGSIPSILSDCTKLVQFQIDANQISGIiIPPEIGLLPCEL 397 

340 evfevwermf tlqlpanlgrngnlikldvsdnhltglipkdlcrgeklem 389 

:i hi H I !l !M hill :| I - ! 

398 NIFLGWQKTKLEGNIPDELAGCQNLQALDLSQSTYLTGSLPAGLFQLROTjTK 447 

390 lilsrinf f fgp.ipeelgkcksltkirivknllngt;\/paglfnlplvtiie 439 

h! -I i li M Ml -hi I M 1 •• - 

448 L.LLISNAISGVIPI>ETGNCTSL\^RLVNr\iRITGEIPKGIGFLQNLSFLD 497 

440 ltdnf f sgelpvtrasg.dvldaiylsnnwf sgeipoaignfpnlqtlf Id 488 

h:| li -h •! I : MM Ml ■: ■ MM 

498 LSE]Sn^LSGPVPLEISNCRQLQ]yni,NLSNNTLQGYLPLSLSSLTKLQVLDVS 547 

489 rnrf rgnipreif elklilsrintsanijirxigix^dsisrcsclisvdlsm 538 

I ill ^ I hh M- i M hi -III I 

548 S&n3LTGKIPDSLGHIiISLNTJ:jILSi<I>jSFNGEI?SSLGHCTM.QLLDLSSN 597 
539 ringeiDkgixmvknla.tlnisgngltgsiDtaignintslttldlsfnd 587 

M U- ■■ ■■■•■\ MM I li II \' ■■ h Ihi I 

598 NISGTIPEELFDIQDLDIALNLSWNSLDGFIPERISALNRLSVLDISHNM 647 
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Figure 1 0-2 

588 Isgi-v.plgg qf Ivfne . . , tsfagntylcl 614 

Ml ■ M -I II ■■ W W 

648 LSGDLSALSGLElS!ljVSLNISHmFSGYL.PDSICv?RQI.IGAEMEGNNGLCS 697 
615 phrvsc ptrrjaqtsdhjihtalf spsriv. itviaaitglilisva 658 

1! i l-l ■ - H : h I- : i 

698 KGFRSCFVSKSSQL.TTQRGWISHRLRIAIGLLISVTAVLAyLGVLAVIRA 747 
659 . . .irqrmkktaigksl.awkltafqkldfksedvleclkeeniigkggag 704 

II I ■ --I i- I IIIM I ll-ll I hllll -I 

748 KQMIRDDiroSETGENLWTWQFTPFQKLNFTVEHVLKCL.VEGNVIGKGCSG 797 

705 ivyrgsmpnnvdvaikrl vgrgtgrsd. .hgftaeiqtlgrir 745 

III: III H:|:| ■ I I l-ih-li| II 

7 98 IWKJVEMPlTOEVIAVKKLWPVWPNIJSrEKTKSSGVTiDSFSAE^ 847 

7^6 hrhivrllgyvankdtnlllyeympngslgellhgskg.ghlqwetrhrv 794 

|:-MI 11 IM Ihhll Mill 111 I 1 M !■■-■-■ 

H48 HKNIWELGCCWNKNTRLLMYDYl^SNGSLGSLLHERSGVCSLGWEVRYKI 897 

7 95 aveaakqlcylhhdcsplilhrdvksnnilldsdf eahvadf glakf Ivd 844 

. IIMI "111111 I l-MhIMMh ill - Mini . I 

8 98 ILGAAQGIjAYIjHKDCVPFIVI-IRDIKAIJMILIGPDFEPYIGDFGLAKLVDD 947 
845 gaasecmssiagsygyiapeyaytlkvdeksdv^/sfgwlleliagkkpv 894 

I . ..iMMMiiiii i-M: mnihiii-ii-: ll-h 

948 GDPAR.SSNTIAGSYGYIAPEYGYS1SJK.ITEKSDVYSYGVWLEVLTGKQPI 997 

895 g.efgegvdivrwvmteeeitgpsdaaiwaivdprltgypltsv. . .i 940 

:|. Mil:: : :M I I • I • 

998 DPTIPDGLHIVDWVKKIRD IQVIDQGLQARPESEVEEMM 1036 

941 hvf kiammcveeeaaarptinrewhmltnppksvanliaf 980 

:|::|: ||||::| 11- - 

1037 QTLGVALLCINPIPEDRPTMKDVAAMLSEICQEREESMKVDGCSGSCKING 1086 
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Figure 11 
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Figure 12 




RPS4.pro 



RPP5.pro 



wo 02/46439 



14/40 



PCT/EPOl/14154 



Figure 13 SEQ ID NO 18 : 842 bp 



AAGCTTAAATGACA.ATTTAGTACCTTGGGTTGGTCATGATTTAGAGCGGAACAAATATACCAT 

ACATCAAACGAGGATATACAGAGAAA.ATTCATGGAAGTATGG.AATTTAGAGGACAATTTCTC 

TTCTGGGCTACAACGGACCGGCCCATTCGCTCATTTACCCAGAGGTATCGAGrn'GTGGACTTT 

TGATGCCGCTAGAGACTATTGGCATCGGATTGA.AAAAAATGTTTACTTCGTTGTT.AAC,AAT'rTT 

CTGAATGCAATATTTTCCTTGTCATGAATATTTAAACTTGTTATTACTTTCTTTTAGCITAGGTG 

TGGACAATTATGGAGTTTACTTCAAACGAGG.AAGAATCTTAAACGCTCGGTTCAGGTCTCGAA 

AACAAACCAACTCACAATCCTGACTTAATTGAGGAAA.ACAATGCAA.AACCACATGCATGCTT 

CCATATTTCTATCATAATCTTATAAGAAAAAACACTACTAAGTGAAATGATTCTGTATATATAT 

AACCAATGCCTTTTGTTTTGTGATATTTTATGTATATATAACTATTGACTTTTGTCATCTATGGA 

TAGTGTCTCGGGCTCTTGGCAAACATATTTCAAAGAA.AAGTTAATGACTGTAATTAATTAATC 

TGAAGCTAGAAACAGAACCCCGAGGTAAAAGAA.AAAGACAGAGCACATGAAGTTTAGTACTT 

TTATATATTT.AATATATCATTCTTTCTTATTGCTTATCTCTAAAGCAAAAACTTCCCTAAACCCT 

AAGCCAAAGGACTCAGATCGATGCAGAACCAAGAAGGCTTGTTTTGGATTTGAGAGCCAAAT 

GCAAAGAAAAAAACTCTT 
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LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



JOURNAL 
MEDLINS 
REFERENCE 
AUTHORS 
TITLE 
JOURHAL 



AB017061 61712 bp DNA PLN 09-AUG-2000 

Arabidopsis thaliana genomic DNA, chromosome 5, TM! clone :K19B20. 
AB017061 

AB017061.1 61:3510337 

Arabidopsis thaliana (strain: Columbia) DNA, clone_lib: Mitsui TAG 

clone :K19E20. 
Arabidopsis thaliana 

Eukaryota; Viridiplantae; Embryophyta; Tracheophyta ; Spermatophy ta ; 
Magnoliophyta ; eudi cotyledons; core eudicots; Rosidae; eurosids ll; 
Brassicales; Brassicaceae; Arabidopsis. 

1 (sites) 

Kaneko , T . , Katoh ,T. . Sato , S . , Nakamura ,Y . , Asamizu , E . . Kotani . H . , 
Miya j ima , N . and Tabata , S . 

Structural analysis of Arabidopsis thaliana chromosome 5. IX. 
Sequence features of the regions of 1,011,550 bp covered by 
seventeen PI and TAG clones 
DNA Res. 6 (3), 183-195 (1999) 

993.97451 

2 (bases 1 to 61712) 
Nakamura. , Y . 

Direct Subisiission 

Subrnitted (26-AUG-1998) to the DDBJ/EMBL/GenBank databases. 
Yasukazu Kfakativura, Kazusa DNA Research Institute, Department of 
Plant Gene Research; 1532-3, Yana, Kisar^azu, Chiba 292-0812. Japan 
(E-mail: ynakamu@kazusa.or.jp, Tel: 81-438-52-3935, 
Fax: 81-438-52-3934) 

Address for correspondence: kaos@kazusa.or.jp 

For the latest information on annotation of this clone, please see 

http : / /wvjw , kazusa . or . jp/kaos/cgi-bin/agd_graph . cgi?c=K19E20 
Genes with similarity to proteins in the databases are described in 
'product' or 'note' qualifiers. Genes that have no significant 
protein similarity are described as 'unknown protein' . 
The software programs used to predict genes include: Grail 
(Informatics Group, Oak Ridge National Laboratory, 
http: //conipbio.ornl.gov/Grail-l.S/) , 

GENSCAN (Chris Surge, MIT, http://CCR-081.mit.edu/GENSCAN.html). 
■NetGene2 (S.M. Kebsgaard, et al . , CBS, Technical University of 
J Denmark, http://www.cbs.dtu.dk/seri/ices/NetGene2/) and 

SplicePredictor (Volker Brendel, Stanford University, 

http : / /gremlinl .zool . iastate . edu/cgi-bin/sp . cgi) . 

Genes encoding tRNAs are predicted by tRN.a.scan-SS 
{Sean Eddy, Washington University School of Medicine, St. Louis, 

http: / /genome .wustl . edu/eddy/tRN.^scan-SE/ J , 

This sequence may not be the entire insert of this clone. It may be 
shorter because we remove overlaps between neighboring submissions . 
The 5' clone is K24G6 and the 3' clone is K20J1. 
Location/Qualifiers 
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source 1.. 51712 

/orgahisin= " Arabidopsis thaliana " 

/ strain= "Columbia" 

/db„xref taxbn: 3702" 

/ chrQniOSQme= " 5 " 

/clone="Kl9E20" 

/clone_iib= "Mitsui TAG" 
CPS joinfl745. .1821,3560. .4202.4287. .4390,4470. .4634, 

4781. .4973} 

/note= " gene„id : K19E20 . 1 » 
/codon_3tart=l 
/ evidence=not„experi!nental 
/product="pectate lyase" 
/protein id='' BABl0313 .1 " 
/ db_xre f = " GI : 1 0 1 7 7 1 7 9 " 
. / translatioii= " MTSRS^TETvlJHEH^AVENPEEVAAM'^.^3?4SIImSTARRKLGYFSCST 
GNPIDDCWRCDRRWQSRRKHJ^'CAIGFGRNAIGGRDGRYYWSDPNDDNPVNPKPGT 
LRHAVIQEEPLTOVFKI«3MVITLKEELIMNSFKTIDGRG\««mi^ 
IIHGIHIHDCRPTGNAIWRSSPSHYGIITOTMaDGDGISIFGSSHIWIDHNSLSNCADGL 
IDAVMASTAITISimFTHHNEVMLLGHSDTYTRDKVMQ\rriAYNHFGEGLIQHMPRC 
RHGYFHVVNMiYTHWEMyAIGGSASPTINSQGNRYIAPFmRFAKEVLSRVRSTRQW^ 
TOfi\'RSEGDLFLNGAFFTRSGSGLGASyARASSLAAKSSSLVGVIT'mAGALNCRGGRR 
C" 

CDS compleinent<9033. .10373) 

/note= " gene_id : K19E2 0 . 2 " 
/codon„start=l 

/ evidence=not„e3cperiKie)atal 

/product=" selenium-binding protein-like" 

/protein_id== " BABl 0 314-1 " 

/ db„x r f J f = " G I : i Oi 7 7 1 8 0 " 

/translation^-'MNPTQTI.FSPGGNSPASSPASHP.S.SLFPQiMNCRTIRDLSQIHA 
WIKSGQMRDTLAAAEILRFCATSDLHHRDLDYAHKIFNQMPQRNCFSWNTirRGFSE 
SDEDKAI,IAITLFYEMMSDEFVEPNRPTFPSVl.KACAKTGKlQSGKQIHGLALKyGFG 
GDEFVMSNLVIU»5YVKCGFMKDARVLFYKNX X EKIDMVV>n'DRRKRDGEI VLWNVMIDGY 
MRiGDCKaARm.FDKMRQRSWSWirrMIS6YSIjNGFFKDAVEVFREMKKGDIRPI^^ 
LVSVLPAISRLGSLELGEIiajHLYAEDSGIRIDDVLGSAI.IDMySKCGIIEKAIHVFER 
LPRENVITWSAMINGFAIHGQAGDAIDCFCKMRQAGVKPSDVAYimjLTACSHGGLVE 
EGRRYFSQMVSVDGLEPRIEHYGCMVDIJ.GRSGIiLDEa^EEFILimPIKPDDVIWKAriL 
GACR^lQG:WEMGKRVMHLMDMV'PHDSGAYVALS^mASQGNWSEVSEMRLR!yK 
RKDPGCSLIDIDG\n^HEFWEDDSHPiaaCEINSl>n^VEISDKLBiAGYRPITTQVLLNL 
EEEDKENVI,HYHSEKIATAFGLISTSPGKPIRIVKNliRICEDCHSSIKLISKVYKRKI 
TVRDRKKFHHFQDGSCSCMDYW " 
CDS 11469.. 12086 

/note= " gene_id : K19E2 0 . 3 
unknown protein" 

/codon_start=l 

/ evideiace=not_experimental 

/protein id:^" BAB10315 . 1 " 

/db_xref ="61:10177181" 

/translation=''MAASVEYFPyYSPPSHQHPLPSPVPPPPSHISPPPPPFSPPHHP 
PPPHFSPPHQPPPSPYPHPHPPPPSPYPHPHQPPPPPHVLPPPPPTPAPGHHVimA/ 
ISLGSLFFLAPLAAALFCYLKKRRKSSTKAEIIEFDEHLKVQETIVQGPHGBQTRWM 
LEEDIHLVEDIKKTEKLSRPSHLSSTGRHAIDISDPNHHPTEQKS" 

CDS complement (join (12718. .13611,13749. .14156)) 

/note= " gene_id : K19E2 0. 4 " 

/codon_.start=l 

/ evidence=not„experiisiental 

/product="anthranilate N-benzoyl transferase" 
/protein id=" BAB10316 . 1 ° 
/db„xref="Gl: 10177182" 

/ translation= "MKINIRDSTMVRPATETPITNLV«^SNVDLVI PRFHTPSYYFYRp 
■TGASHFFDPQV■MKEALSKALVPFYP^1AGRLKRDDBGRIE1DCNGAGVLFWADTPSVI 
DDFGDFAPTLNLRQLIPEVBHSAGIHSFPLLVLQVTFFKCGGASLGVGMQHHAADGFS 
GLHFINTWSDMARGLDLTIPPFIDRTLIiRARDPPQPAFHHVEYQPAPSMKIPLDPSKS 
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GPEm'TVSIFKLTRDQLViUJKAKSKEDGNTVSYSSYEayrLAGHVVmSVGKARGLPiroQE 

Tia.YIATDGRSIUjRPQLPPGYFGWIFTATPIAVAGDLLSKPTlflnfAAGQIHDFLVBMD 

DNYLRS ALDYLEMQPDLSALVRGAHTYKC PNLGITSWVRLP I YDADFGWGRP IFMGPG 

GIPYEGLSFVLPSPTHDGSLSVAIALQSEHMKLFEKFLFEI" 

joizi{j.5849. .18529, 18605. .18747, 19300. .19808) 

/note="gene„id:K19E20.5" 

/codon_star!;=l 

/ evi dene e=iio t_e2jperiineiital 

/product^:" receptor protein kinase-like protein" 
.''proteiii_id= BABloaiT.l ' 
/db_xref="GI .-10177183" 

/ translation= "MSLQMPIPRKKALTVSHFSITLSLFLAFFISSTSASTNEVSALr 

SWLHSSWSPPPSVFSGWWPSDSDPCQWPYITCSSSDNKLVTEINVVSVQLALPFPPNI 

SSFTSLQKI,VISNTNLTGA.T.SSEIGDCSELIVIDL£3SNSLVGEIPSSLGKiKNLiQELC 

LNSHGLTGKJPPELGDO/SLKffljEIFDI'r/LSBtJLPLELGKXSTLESIRAGGNSELSGK 

IPEEIGNCRl:ffjIO/LGLA.ftJTKISGSLPVSLGQLSKLQSIiSWSTMLSGEIPKELGNCSE: 

LINLFLYDm3IiSGT?LPKELGICLQHbEKMjLWQI®}LHGPIPEEIGPMKSLNAlDLSl^ 

FSGTIPKSFGOTjSlSnJQEimSSliffirETGSIPSlLSNCTKIiVQFQIDANQISGLIPPEIG 

IJ:,KEIJSriFLGWQimiEGNIPDEIJ^Q]Sn:.QALDLSQNYI.TGSLPAGLFQLRlSnJTK^^ 

ISNAISGVrPLEIGNCTSLVRLRLVNNRITGEIPKGIGFLQNLSFLDLSENNLSGPVP 

LEISNCRQLQ]yn:m.SNOTLQGYl.PL,SLSSLTKLQVLDVSSNDLTGKlPDSLGHLISUSr 

RLILSKNSFNGEIPSSLGHCTm;QbLDI.SSI«aiSGTIPEEI.FDIQl)l.DIAIjr>3LSWI>JSi:. 

DGFIPERISAUmLSVX.DISm^lMbSGDLSALSGLEI^^VSI^MISHlSfRFSGYLPDSKVB'R 

QLIGAEMEGN1^LCSKGPRSCFVSNSSQI.TTQRGVHSHRLRIAIGLLISVTAVLAVLG 

VIAVIRAKQMIRDDNDSETGEtnJimJQFTPFQKLNFTVEHVLKC^^^ 

VYKAEMPKrREVIAVKKLWPVTWNIiIffiKTKSSG\^ IRHKNTVRFLG 

CCWnSKOTRLLMyDYMSNGSLGSLIJffi 

GIJUa.VDDGDFARSSNTIAGSYGYIAPEYGYSMKITEKSDVYSYGVVVLEVLTGKQPl 

DPTIPDGLHXVDWVKKIRDIQVlDQGLQARPESEVEEMMQTIiGVALLCINPIPEDRPT 

>!!KDViymiSEICQEREESMK?iraGCSGSCmiGRERGKDDSTSSVMQQTAKY^ 

SASSU^YSSSSSATSNVRPNLK" 

join(22264 . .22578, 23196 . .23354) 

/note="gb|AAD49765 .1 

gene_id:K19E20.6 

similar to unknown protein" 

/ codon..,s t a r t = 1 

/ evidence~not_experiraental 

/protein id=" BAB10318 . 1 " 

/db_xref="GI: 10177184" 

/ translation= "MDPKSPEFIIDQPLKILGFVFDELSATRVSGHLTLTEKCCQPFK 

VLHGGVSAI.IAEALASI.GAGIASGFKRVAGXHLSIHHLRPAALGE1VFAESFPVSVGK 

NIQVWEVRT.^'gT<AKK.TSTPDNKItIVSTSRVTLFCGLPIPDHV^APBELKKVT£5KL" 

join {25778. .2607 2 , 26283 . .26377,26531. .26550,26729. .26818, 

26916. .27031,27145. . 27244, 27447 27518 , 27597 . .27544, 

27726. .27803,27879. . 27998, 28105 .. 28210 , 28303 .. 28456 , 

28574. .28708,28845. .28975,29226. .29304,29399. .29446) 

/note=" contains similarity to 5 ' -nucleotidase 

gene_id:K19E20.B"' 

/ codon_3 tart=l 

/ evidence=not„experimental 

/protein„id= " BA310319.1 " 

/db_xref="GI:101771B5'' 

/ translation^ "MJJLCADQVFDKKOLIrmDMSTKMKFPSFSAAELPDLRKSNKRRGS 
LRMIKCRAAGADGGRVAVGDDVFSVTTSSKYEVDYLGQSTKGDLNLKLDPLQSFGDGQ 
ATLEGPIEEVARTEAQAAENLIRELGIQGPPSAQHSPRfiIFCSRTLNI,RSTSAIGYDM 

DYTLMH¥NVMfi.WEGKAYDyCMENLKSMGFPVDGLAFDPELVIRGLMIDKEKGmVKM 

RFGYWRAMHGTKMLSNKAVSEIYGRELVDLRMQSRl^raFLNTFFSVSEALAYAQWDR 

LDDGFISADLGTLDYKGLYKAVAKALFRAHVEGQLKSEIMSKPELPVEPDPELPLALL 

DQKEAGKKiLLXTNSDYHYTDKMKKHSFimFL-PNDE4DWPJDLFDM\flVSARKPEFFQ54S 

KPLYEWTGEGLMRPCFKAETGGLYSGGSAQMIESSIiI>JVHGBEILYVGDHIYTDVSVS 

KVHL,RWRTAL1CRELEEEYMALIGSRGHREELIEL1NQKEWGDLPNQLRLAX,QRRSK 

GRPAQTL.AATNLDDQELTETMQKLLIVMQRLDDKIGLMLETDGELFNKRYADIYTSRV 

SHFL,NYTPF^r/FRSQEQSLAHDSPLPDAGIEW" 

join (30274. .3 0620,3 0707. .32633) 
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/note="gene_id:K19E20.9" 

/codon_start=l 

/evidence=not_experimental 

/product="inutator-like transposase-like protein" 
/protein id=" B.AB10320 . 1 " 
/db_xref="GI:1017718S" 

/translation="MaTKKVIAICQSGGEFVTNKDGSLAySGGDAYAIDIDQDTCMSD 
FKSELAENFGFSWEamTLKYFLPGISKKTLITISKI)KDFQRWSFSM)APIWEIFVIiPE 
EAEARWSNMPASRSSRTTASEAWPWSAGDIWGDDMIANDFQIEMGIMPEESSPL 
PCOTVLTDEKQHIKJUiQQWENAITGVDQRFNSFTEFRDALHKySIAHGFTYKYKKiros 

HRVS WCKAQGCPIimiTASRLSTTQLICIKKMITRHTCERAWKAGyRASRGWU'GS 1 1 
KBKLKAFPDYKPKDIAEDIKREYGIQLtOYSQAWRAKEIAREQLQGSYKKAYSQLPSFC 
KKIREmPGSIAIPMTKEDSSFHRLFISFYASISGFRQGCRPLLFLDTADLMSKYQGV' 
^^JVATAPBAEDGlFPVAPAWDAETBDCTmlrFLEHLKLALADPRTITFVADFQNGLKT 
ALPLVFEKQHHHAYCLRHLAEKLNMDLQAQFSHEARRFILNDFYAAAYATOPDAYYRS 
^,BNIKSISPDAYTWIESEPLHWA^IAIiFEGERYNHMNSIFGLDFYSWVSEAHELPI•^Q 
MIDEIiRAKLMQSIYTHQVQSREWIVSTLTPTNEEKLQKEIELARSLQVSAPHNSLFEV 
HGETINLVDINQCDCDCKVWRLTOr.PCSHAVAWECIGKSPYEYCSRYFTSESYRLTY 
AESIlSrPVPKTTMT^I^XItiEEPPVEG\^'/SWPPPTRLTPPGRPKSKQVSPLDMFKRQLQC 
SNCKGLGHNKKTCKAVS " 

complement(join<33092. .33310,33890. .34039.34118 . .34270, 
34394. .34521,34741. . 34852. 34989 . .35105, 35298 . .35438) ) 
/note="gene_id:K19E20.10" 
/codon_start=l 

/evidence=riot„experiii)ent:al 

/product= "mitochondrial carrier protein-like" 
/protein_id- " BAB10321.1 " 
/db_pcref="GI : 10177187 • 

/translation="MIPTTATVDDEPGQIKRALIDASAGAISGGVSRSVTSPIiDVIKl 
RFQVQIiEPTTSWGLWGNLSGASKYTGMVQATKDIFREEGFRGFWRGNVPALLMVMPY 
TSIQFTVLHKLKSFASGSTKTEDHIHLSPYLSFVSGALAGCAATLGSYPFDLLRTII.A 
SQGEPKVYPTMRSAFVIJIIQSRGIRGLVNGLTPTLVEIVPYAGLQFGTYCMPKRWK^ 
VmYKLSSKIPINVDTmiSSFQriFICGI^GTSAKLVCHPLDWKKRFQIEGLQRHPR 
YGARVERRAYRMMLDGLRQIMISEGWHGLYKGIVPSTVKAAPAGAVTFVAYEFTSDWL 



3.67B8. .37987 

/note="gene_id: K19E20 . 11 
piri IT09376 

strong similarity to unknown protein" 

/codon_start=l 

/ evidence=not_experimental 

/protein_id=° BAB10322 .1 " 

/ db_xref = GI : 10 17 7 1 8 8 " 

/translation="MADSQRLSTASGVKDGQPPWKiaaCIiSNDTTSNPSLPYDVILIIL 
ARVSRSYYTNLSLVSKSFRSILTSPELYKTRTLLGKTENFLYVCLRPPDEANPRWFTL 

YRKPNQTI,TDHTTKK?a?KKKKKEEKSSVNLLAPISlLNSHPVEl/fSAIISVDHYLYAIS 

ADIEIC»PYSfTVPYLDCRTOT;\'NEAPRMRI^\HTSSEFEGI^/YLPGSFSSPDSLNO^ 

NTI4TQTVKPVPPEKRt4FKLENLEKKIYYKSFHLDSRAGKGLSLSYKSKHLTCGl,VVLD 

TVDSYLRSSCMIENIAYFYRKGNFIWRGLDGKILVYGKIEGLEGLPKF.9RYSSVQLAE 

YGGKLVVLVTOKYVPASGYKEKMIWCAEISLEKRNGKEIWGNVEWFDAVLTVPKSYKIL 

CATAATL" 

38404. .39522 

/no te= " gene_id : K19B2 0 . 12 
pirjlT09375 

.■strong similarity to unknown protein" 
/codon_start=l 
/evidence=not_experimental 
/protein_id=: " BAB10323.1 " 
/db_xref-"GI:10177189» 

/ translation= ''MSSPEKKRKKHQKSSPNPSL.PSDLIVSIL.ARVSRSYYTNLSWS 
KTFRSILTSPELYKTRTLLGKPETFLWCLRFPDEANPRWLILYRKPMQTLTKKKKKK 
EDSSWLrAPIPILNSPTVSWSSLVAVGSYLYAITADIKDSPCSW/fyLDCRTHTVJI,D 
SPPiRLAHINSDFNGRTYFPGSSEKPDSLNCVEVYTSiTNTQTVJNPVPPQKRKLKFGms 
GKIYIPPCQENHRKAVALNPKVLTl'gEAVGLGTNLDRGSFCMIGNIAYCYDPSGKFRWM 
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NCNTAEGDWSRLEGLEGLPKFARYSTVKLAEYGGKLVVLWDKYVAASGYKEKMIWCAE 
ISLEKRNSEEIWGKVEWFDAVLTVPKSYKFVCAKSATV" 
CDS 40591.. 41703 

/note="gene_id:K19E20 . 13 
pir| |T09371 

strong similarity to unknown protein" 

/codon_start=l 
/ evidenGe==not_experimental 
/ pr o te in„i d= " BA B103 24.1" 
/dij_xref= " GI : 10177190 •■• 

/ translation= "MSSPERKRKKRSLEPSPESTPNPSLPDDLIVSILARVSRLYYPI 
LSLVSKSSRTLVTSPELYKTRSFFNRTESCLYVCLDFPPDPNPRWFTr.YRKPNQNITE 
KTKNSSGFVLAPIPNHHSHSSSIVAIGSNIYAIGGSIENAPSSKVSILDCRSHTWHEA 

PSMimKPNYPAMfVVDGKiyV^JSGLEEFDSSKMaErW'DIKTQTWEFVLSPI^^ 
SLVIEGEiyiFGDK^/i7TYKPKEDRWGGVGEHQSMDLGLFFHSYCVIDNVI.YCYRPGGI 
KWyESEKRSVWKLRQLKGLSiajASS{>TaADYGGK>mLLWDXYIPCSGNKSHSISCAV 
VSLERCKHQGXRGKVEWFDDMLTVPSSYWFVGALAATL " 
CDS join(42528. . 42595 . 42932 . .43054, 43183 .. 43265 , 43625 . .43734. 

43820. .44100) 
/note^"gene_id-.K19E20 .14 
unknown protein" 
/codon_start=l 
/ evidence-no t„experiniental 
/protein id=° BAB1Q325 . 1 " 
/db_xref= "GI: 10177191- 

/ trans la t i on= " MASNSEAGGSADYETLMSTSDVELLKRAWRNEKAAPEILQYEGA 
LVDRAKEQIELVEETIEDYVENGIDPLWSLYQMDLDRAQFLLRSYLRVRIjLKIEKFM 
PHJSOiNSEEAERRLSEQEKVFATRCaJDDiAKHFEETVLLKLPENYQSVLKQSLISEVD 
MVARWFV£iVMTTLVHr.,QCRSRIWIPLLFVEARTl.SPSIYTKSEHSLIPPYTYIYTQS 
FCNLSVGLPI>I.TLAEERAPRLWRWSAVISTSYVTR1." 
CDS join (47405 . .47596. 47798 . .47863 . 48017 . . 48125 , 48257 . . 48403 . 

48519. .48622.48732. .48830,48921. ,48990.49159. .49255, 

49331. .49385,49512. .49595,49681. .50009,50129- .50301, 

50385. .50521,50576. .50747) 

/no t e= * gene_id : K19 E2 0 . 1 7 " 

/codon_start=l 

/ evidence=not_exper iroental 

/product= " arginine methyl transferase- like protein" 
/protein_id= •• BAB10326.1" 
/db_xref=»Gl: 10177 192" 

/ translation= "MEIPSUSfKCKJEFTLASVTDLTSPSSSLSSSPWATFSCVNEVKE 

r.HFQESKSSDGPSF0r.SSTQLPKI.GPLQPTCVSDGSISSAKEKSSFSRGWIKFRDEK 

DSKEFCDSFEECKKDD.^VKQGSMjPNGTWSiU3KSKFDDKIEAA£AKMY?'HYYGQLLH 

QQKnsILQDYVRTGTYHAA\mENRSDFSGRVVVDVGAGSGir.SMF.VlLAGAKHVYAVEAS 

EMAEYARKLIAGNPLLABRlTVIKGKIEDIELPEKADVIilSEPMGTLLVIffiRJS^ 

lARDRFLSPNGKMFPTVGRlHMAPFADEFLFVEMJ^lin^ 

HQGYFSQPVVDAFDPRLLVAPSMFHVIDFTMMTVKLFPTi:.DSVPRCFLKIEDTFLSCY 
FYVQSSLFSEWHDPNSITVKFFSQEEQFYEIDIPLKFTASVCTRIHGLACWFDVLFBG 

SWQRWFTmPGAPTTHWYQIRCVLSQPIHVMAGQEITGRLHIilAHSAQSYTINIjTLS 
.^Km'GPGAJIQGGILQTSSCKI.DL>KEPYYK^tSOPQVYPTQEPPAQSQDIHIHSDDLEEL 
SLLQQNANAQL" 

CDS complement {join{51155. . 51406 , 51952 . .52131 . 52447 . .52695, 

52806. .52879,53145. .53270,53408. .53572,53658. .53785, 
53900. .54028,54123. . 54254, 54347 54494 , 54771 . .54907, 
54997 . .55283,55380. .55498,55559. .55611,55791. .55935, 
56245. .56352,56467. .56535,56693. .56805,56925. .56994, 
57106. .57207,57282. .57436.57516. .57540) ) 
/note== " gene_id : K19E2 0 . 1 B " 
/codon_start=l 
/evidence=not_.experin\ental 
/product=" isoleucyl-tRNA synthetase" 
/protein id=" BAB10327 . 1 " 
/ db„xref="GI: 10177 193" 

/translation="R\.saCKASEGEKQEDGKYKQTVDLPKTGFGMRANSLTREPELQKI, 
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WEENQWKRVSDiSilSnsrGGSFILHDGPPyANGDLHMGHMJNKILKDIINRYKLLQOT 
WPGWDCHGLPIELKVIiQSLDQEVRKELTPLKLRAKAAKFAKATVKTQMESFKRFGVW 

ADVMNPYLTLDPEYEAAQ1EVFGQE4AL.KGYIYRGRKPVIMSPSSRTAL,AEAE1.EYPEG 
HTSKSIYAIFKI^VGGAKTSLTJ3EFIP^^YLAVWTTPWTMPAKftAVAWAKLQYSVVE 
VQSFKQQKLFViyATDL\rpALEAKWGVKLSISKTFLGSD3:.ENCRYTHPIDNRDCPWr 
GGDYITTESGTGLTOTAPGHGQEDYATGLKYGLPLVSPVTJDEGKFTEEAGQFRGLSVL 
GEGNTAVVSYLDEJ^lviSLWEESYAHKYPYDWRTKKPTIFRATEQWFASVEGFRTAT^ID 
AimVKmmQAVhlRl S AMTS SRSDWC ISRQRTWGVP I P AFYHVKTKEPLMWEETI NH 
VKSIISQKGSDAW£MSVEDLLPEimiDKAADYF>KGTDTHDVViFDSGSSWAGVI.GKRE 
GLSFPADVYLEGTDQHRGWFQSSLLTSIATQGKAPYSAVITHGFVLDEKGMKMSKSLG 
)WVDPRLVIEGGKKrSKDAPAYGADVMRLWVSSVDyTGDVI.IGPQILRQMSDIYRKLRG 
TLRYLLGNLHDWRVDNAVPYQDLPIIDQHALFQLENVVKinQECYElSrfQ^ 
RFTIVDLSNFYFDIAKDRLYTGGTSSFTRRSCQTVLSTHDLSILRVTAPIVPHLAEDV 
WQNI.PFEYR35!EDGSAAEFVFELKWPTLNEQVinJSFPAEDVL.FWQRL 
AHl^Kf4IGSSLEAKVYLHTADAGMAAKrj:,EMSEAKNEADTLQRIFITSQVEV^ 
EMISSVQHTGEYVEGENKWlGVSRAiaGSKCERCWNYSGQVGSFSDHPTLCGRCFSVr 
VANPPEPAVAAVNSLA" 
CI .% complement (58740. .59315) 

/note=" contains similarity to disease resistance response 
protein 

gene_id-. K19E20 . 19 " 

/ codon„£3tart=l 

/ ev idence=not„experimentaI 

/protein_id=" BAB10328.1 " 

/dbjcref="GI: 10177194" 

/ trans lation= ■' MSiaiILILTAQrLLIiTATMjAGKNGEDFARTINRKHI.GLGKKEK. 
LTHLRVyWHDIVTGRNPSSIRIQGPVAKYSSSSYFGSITMIDMALTLDVPINSTVVGQ 

AQGMYVG^AQKEIGLLMAMISniJSiFKTGKYNGSTITlLGRm^ 
FARGYVEARTKLFDMKTGDATVBSNCYILHY" 

c;n; 5.9933 . .60445 

/note="gene_id:K19E20.20 

unknovm protein" 

/codon_start=l 

/ evidence=not_experimental 

/protein.id^" BAB1Q329 .1 " 
/db_xref="GI: 10177195" 

/traEiSlatiori="MPRVNIGPDmaQIEITTLQSQSKSRKLKKIKMASGSERVVVVG 
VDDSAHSYHALETMiDriFFIPFKANPRFKI.WLHARPTATFFLGVAGPGTVDI I PMVE 

EDLMKTADLVKKKC AEVCSAKSVEISSLEVIEGDPRNIJynJEAVERHHA " 
i.rxf^jK join (complement {AB023028. 1 :1 . .96) , 

complement (61652. .61712) ) 

/note~"gene_id:Kl9E20.21" 

/product="U4 snRWA" 

/ evidence=not__experimental 
•.•.>tJ?fT 19394 a 11344 c 11242 g 19732 t 

1 cgatcaacat gactaatctt ccacgagact agaaccacta acccacctcc tctagataaa 
«»S Qctaaacaat tctctacaaa aaatttccat taaatgtttc taattgtaat tgaaaacatc 
;J1 aacttaaaat attattaaat aaattttgta taaacgtgat tcattttaac ttttttttta 
isi atgaattcta aatccaaatg tgtcttaatc tactgctatc acaactccac aatattagaa 
;u acaaaacata tagagcttgg gccattttct gcttttatga ttttcacgaa gtgttfctcac 
)0l caatatactt acatcactag ttcagttgaa tctacttaaa atcataaggt acgtaactca 
iiji aatcacatta acgattcaag ttcattttca tggaattaga tattttagta aaattactac 
i:i ctattgatga atttggatat ttctagtttt aaaacgggag gaaaaaatga ttaacgtctt 
tH! tcatagcact atataaatgt taatttttat ttgaattaaa aatattcagt taacgaaagt 
'. U gtttgtgatt aatctagagt ttatttggag aaatagtcaa aaaaaagttt ataatacctt 
hO\ ttt.cctggtt catttttttt tttttttgcc aacatttaaa ttccaattgt tgagatttta 
tfai gcccatcatt taatggtgag aagaatgaaa gtgctaaact atttgtctta aatgattata 
7::i ctttgattat aaaacfcgtga caatatgatt attatacata ctgttaaatt ttattctatg 
.'81 tgatacatat ataaatgttt tagcaagtac aaaacatgga aaatgtgagc gacttacaat 
341 tataacatca attctcatta gcatttccgc ttttactgaa aacaaaaacg agtgagtaca 
901 atttgtcttc cctttttttt ttttttggac acaaagtaca atttatcttc tatttcctcc 
^^6! tthttgtgta atttgtctct tgtcgtatca cctacatttt cagcattgga tgttacaaat 
1021 gcgaacaaac tataatgtaa aaaacatgtc agcagcatcg ggccatttca aactaCgtaa 



wo 02/46439 



21/40 



PCT/'EP01/14154 



JNCBI Sequence Viewer 



1081 ataatgaaga aattatatga tctattctaa ataattaatt aggccaaata ttattctaag 
1141 tgaaaaatgt cacgctcaat ttctcttata taataactcc ttcccttgaa gcctttcttt 
1201 aactcttttc atcttccttc ttccttcttc ttctttaaac acaaaaaacc gacaaaagcc 
1261 tctgtctcgg acagagacaa agaaacagaa cagagcaaga acaatggctg ttacacaaat 
1321 acttgtcgtc tttgcttctg cattactcct atcaatgttc ttcactggcg tcgattccac 
1381 aaggtatcga aatgtttttg agtcaaattt atacgtgatc tggttcatgg aacaaagcat 
1441 tgaagttctc acttctcaaa tctaagctaa aacaagaaaa ccctttttct tgatttgaat 
1501 tacttgatca tgattacatg gattttgtat tatagatcta aatatggagg gagttctttt 
1561 tattttattt ttttatcata gafctatgtag agatccatgt gatcaaacac aagtgtgtta 
1621 ttagagtcta ttgaatgata atgcaacatt tcaatgtgta ttagcagtga ttaagcgagg 
1681 tttggtatat tggttttata atttgagttg tgttttcggg tttaacfcaat gagttggtta 
1741 attaatgact tccaggtcta atgaaacatg gcatgaacat gcagttgaga atccagagga 
1801 agtagctgcc atggtggaca tgtgagtttt tctgttgctt aaattttgtg ttatatacta 
1861 tatatgaata ataatgattg ttgtttgttc tctaattaag tattttagca tctaatgcga 
1921 ttggttagtt agtaaaacaa tttaaaaaat acactaaact agcattactt gacgttttgg 
1981 ttggataaat tcatgtgatc ttcttttata catttttaac tactggagaa aaagaaaata 
2 041 aaaatcagaa aactggtgaa agttgtattt atgtggagca gggctcacat ggcacttgtg 
2101 Caagaatttt tatacatcct ggtcaaaagg tcaaaaggtc aaaatccata gggccaaaca 
2151 ccacacctcg attccatttc tttcatactt tttgttfcatg tfctttgtttt atattacctt 
2221 aattctcata atgataaaaa aaatttctct tatgcaaatt ttaactgttt ttaactaaaa 
2231 gtatgttctt ggttcccgcg cactgacttt ttcaacgaga aaatcttttg attatttaga 
2341 atttttcgtt tgccgtagaa aatgattgaa aactttgttg attgttctat tttatacagt 
2401 attatgtttc cacatfcfcatc aaatacaaaa cgaaaaccta aataggttat aaaagtttga 
2461 ttttcactta tagaaacatt cccatcggag ttattttttg gttcttccgt tttctttttt 
2521 gtgttttcaa gattgactac atttfcgactc cgtaagttat agaacttaat gccaacagac 
2531 tcgafctttcg atttgtct.tc gcttccatgt ttttaattaa attttttcgg catgaaacca 
2641 atcaatacat ttaaacaatt tgcaaatcat attgtaaaac ataaaaaaaa taaaaatcaa 
2701 ataagtagtg aggataagtt. ctttatgctt attcttactg aggggtttca gagttattgg 
27 SI tggaccagac atgaaacatg atgcataaca tggttcattt agttgagtaa gatttggatg 
2821 ggattcttca tttaatgcaa tgcacccaaa tcafcatagat gaataaattt atttttgtaa 
2881 aaaacattaa atattattgc tfcgcaaaaat cttatggagt tattgtagac attggtccaa 
2941 aaacacgtgt aatgttcttg tttttgtttt tgaagagaaa catgtgtaaa tgttccaaca 
3001 cctaatcttt aaaacagtga attgccatca gccccatgca tctccctcgt gtagtgaatt 
3061 gttctctgaa tgatttcact cgtatcttta gtcttatctc gttattactt ataactatac 
3121 tagt.act.t.t:a aagatctgtt: tggcacaagt tataatgtta tgactatatg attaaacgaa 
3181 tagtattttt tggattatta ttacccaaaa acatatacaa ctattataac gtcatttcta 
3241 agaaagttta gcagtgtttt attfcttattt ttggctttta gaaafccattg gattcttgat 
33 01 tgaataggaa tagtttgcgt aacaaagcaa gatttttttc aattttctta caatgcgtga 
33 61 attagaagca ttagaataat cccaccgacg aaactatttg tcaaatgaca cacattaaaa 
3421 gtggtccacc caattcctct gatttcggtt aatatagtaa ccatgtttat atttagtgat 
3481 taattctgtc ataattcaat ttagcttagt gatcatgtta aattaatctg gtttgactaa 
3541 accggttttg atggttcagg agcattcgaa acagcacagc gagaagaaga ttaggatact 
3601 tctcatgctc aaccggaaat ccaatcgatg attgttggcg ttgtgaccgg agatggcaat 

36 61 cccggcgaaa acatctagcc aattgcgcaa tcgggttcgg tcgcaatgca atcggcggta 
3721 gagacggtcg ttactacgta gtgagtgatc caaacgacga taacccggtt aaccctaaac 

37 81 cgggaacatt acgtcacgca gtaattcaag aagaaccatt atggatcgtc tttaaacgtg 
3841 acatggtcat aacattaaaa gaagaactaa tcatgaacag tttcaaaaca atcgatggtc 
3901 gtggcgtcaa cgttcacata gctaatggtg catgcatcac gattcagttc gtgactaaca 
3961 ttatcattca tgggatacat attcatgatt gtagaccaac tggtaacgct atggttagaa 
4021 gctctccttc gcattatggt tggagaacta tggctgatgg agatgggatt tcgattttcg 
40 81 ggtcgagtca tatttggatt gatcataatt cgctttcgaa ttgtgctgat. gggcttattg 
4141 atgctgttat ggcttctact gctattacta tctccaataa ttacttcaca caccacaatg 
42 01 aggtaagaat ttgtctcaca ttttatgttt tttggttgct tgaatcggaa gttagttttt 

42 61 tttatggtag ctctgtttta tgacaggtta tgttattggg acatagtgat acctatacaa 

43 21 gggataaggt tatgcaagtt acaatagctt ataatcattt tggtgaaggt cttattcaga 
43 81 gaatgccgag gtacatactt tggcctaact tatgattctt ggcaaaaact tcgaaaagtc 
4441 gacaaaataa acatttttca ctaatgcagg tgtaggcatg gatattttca tgtagttaac 
45 01 aatgactata cacattggga aatgtatgcg atcggtggca gtgcgagtcc taccattaac 

45 61 agccaaggaa atagata'c.ct; cgccccaaga aaccggtttg ctaaagaggt actttcaaga 

46 21 gttaggtcga ctaggtcaaa taaagaaccg atctttcttg acttttgact tagtgtcaaa 

46 81 ctataaccct tatacttata gattacagtt tcaacaaaat tcttacagca aaaacttatg 

47 41 ggatgtttca ggtgacaaag agagactafcg cagggcaatg gcaatggagg cattggaatt 

48 01 ggagatcaga aggagatctt tttttaaacg gagctttctt cacacgttct ggatcaggtc 
48 61 ttggagctag ctacgctaga gcttcaagtt tagcagccaa atcatcatct cttgttggcg 
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4921 tcattaccta caatgccggt gctcttaatt gcagaggtgg tcgccggtgt taatagttac 
4981 taaagtctat cttcttttgg tttcttgcat tatacatggc aatagcacta ttgaatgtcg 
5041 aggcttgatt taaaattgcc tcaactgtafc tataaagttg taatgatttg tttcaccgcc 

5101 aattattgat gtatctataa aagcaaacaa agttggaact ctccctttac tctctcatct 
5161 taatgttttc ctacacaagg ataaaagaaa catagaaatt tgtcaaaaaa aaaaaaaata 
5221 gcagttctct cggaaatgca atgcagtatc aataatattg gtgaagttta tcafccttggc 
5281 atgtgactat ggtttttgga tcagtttggt tcggttatat cgcctatgat tgtttttgtt 
5341 cttggttagc tttggttaag agtttcaaaa ccaaatttaa ccgattttct tattccggct 
5401 aaattggtta tttttggctg aagttggttfc tattttccgg ttaaattggt tcaataaaac 
5461 tctagtaaca gttaaaagaa gtctaaaaat ttggtgaaaa cgggtctcag tttggtataa 
5521 tctggttatt aaaacaaaaa tacctaagga cggttcgatc cgaaccgaac caaaattttc 
5581 cgtttagcga tgcaaaatta aagttcttcc tagaggttca accgatagaa ctgaaaatct 
5641 tgattcaatc cggttcggtt ttctcggttg ggttatttat ccattggctg aaatcccgca 
5701 taacgggtcg tcttattatg taggctaact acaagttggt cattcaaggt taaacctgac 
5761 caacccgtag agtccgctaa ccgggatatg attaagatgg gctgaccaag actaaccccg 
5-821 tttggtcgag gataacatcc ttcataatga tggagaggtg tggggtatgg tgtcaaaggg 
5881 aataagaagg gatgtgaagg tggtggaacc acgtgagagg agacaagaac cacatgagag 
5941 catatgagag gaaataaaag aaaggacctt agtatcattg tcatgttatg ttgttgatat 
6001 tgataagata ggtttggcaa atggcttgct tctctttcat tacacacttt gtttcaccaa 
6061 atgttctctt tatgctaatc tctccttttt tattttctta atttgtcaca tttagcgtgt 
6121 ttttttttac agtcgcagag actgcttcta cggccacaaa attacactat tacggcaaga 
6181 aatttaacgc cgttgagtcg ctgcgattca gacgacaagt aaaaacagtt tttaaagttg 
6241 acgctgaaat ttccagcgat tacggaafcta ctgtttcttt ttctttttct gatcgctgcg 
6301 gcaactgcgg cactttattt ttcccaaata cacatcatct caggcttfctt atttgtgccg 
6361 cagttcttgc agctgcgact gttaaacgaa catgctaatt atcttaacta aatttaattt 
6421 gtttcttaag tgagtaagcg gagtaatttc ttaaacaatt tgaatagtag gttagataaa 
6481 attaaaattt aagtttagaa tttgatattc ttttttettt Egacctgaag tgtacacata 
6541 aCcccgtctt aaatagcatc atgaacacafc caacaatcca tctcaatttg ttttcgtggt 
6601 ct:cgcgaaag acgtacgtac aagtggctaa caattatatt aagcatcctt taaatacatt 
6661 aagaataata atcttatgtt acagaaat.ct aagacgt.aca gagggaatta tgttacatgc 
6721 tccggtttag aatatttcat cacaaaaaat atattttatt fctgtagattt atatatgtta 
6781 acatttcaaa agcatgtact ttaatctttt aacaatacat tgccgttttg aaaacaaaag 
6841 tcgaaagacg tcaaatagag aattttgacg tcatactact atgttactta ttccatgaag 
6901 aaaacataat atctaaatta gattttgttt tgaaaatttc gacatagtat tattgtccgt 
6961 acttggacgt ataccaaaaa ccacttgtat ttaaaaatca taaacaaaaa aaactcattt 
7021 atctatgcga acataacaca atatcaaata tgttagataa aaagtagtca acacfctttct 
7081 caccccttat attgtatgaa ttatgacata atatatagtg tttatagaat tggtatgaaa 
7141 ttatttggtc aacaaatata gtctccattt tttgatagtc aacactatca aaatttgtta 
7201 aaaaatcaat agtttttacc catatagttt acctaccaca ttaaggtttt ttcctcttgt 
7261 agtatataag ataaactatt tgacaacaat aaagtaaatt ttacatacaa cataagaaat 
7321 aatacgataa ttaataagtc ttgcaccatg tctacgaaat tattttctta tttcatgctt 
7381 ttggtagttc tcttctcagg taattaactt tttttcttct tcttcttata tgctcaacga 
7441 tctaatacgt caatgtacaa ttagctttga cattttgaca gaacttcttt atattttttt 
7501 cataggaagc taatgaacta atgccatcat ttttttcttc atacatattc afctttattta 
7561 aaagaagcat actaatttta atgataattc tgtatagttt tfcaatttaga tacatatcta 
7621 tgatacttta actcactatt tgttgcagta ttaacgatta ttectaaaac tgaggcacaa 
7681 aagagatgta gacaagaact agagccggga aagcaatgtg tgttggcaaa gtgtagggaa 
7741 Gtgtgtttca aacaattgaa aggattcggg agttgtattg agaagccacc aggctcttct 
7801 aagtatacat gtaattgttt ctacaattgt ggtccaccag gattttttta gaacgcaaga 
7861 taatgtaaac tcaatgaaac aatagtttaa ttttctatgg afctacaaaat gaaaatttat 
7921 ctttaaatta caaataatgt aatagtttaa caaaaaatga aaatttatct ttatcttaaa 
7981 attacaaaac ttttatactt gccatattta aggtttgttc ttctgatagt atattactat 
8041 attgtgcttt aaccattttt tgataaacgt gatgtagcca tctccttctt ctagatcttc 
8101 ctgcataaca attfcaatcac acaaaatcaa tttctacctc atcggattat cacagaccca 
8161 tgaacaaatg atacacatca aacccaaaat tggaagcata agacaagcta aagactaaga 
8221 agattggtac cctttgcgag atttgggacg agaaaatagc Cagagacgaa gaatctcctt 
8281 agacgaggag aatagatacq aggagatagc gagaaagtct aaatttcgtt tttccccaaa 
8341 tgtgtaaatt ttaaagat.at: tttccgtafcg agctagctca ttatgcccgc gaacaaaatt 
8401 ggtctttaga ccatatgatc cgttttagtt gcgggctgac cacagaccgt ccacaatatt 
8461 ttggactgag cttagcccat ggttggatga ccatttgaca- gttctactta ttaaagcgat 
8521 acagcttgta cagctttgac agttaccaat cacaaaaact ttatcaaaaa cgttacaaaa 
8581 agttttactc agagtttttt ttgtacaact tttttgtaca actttttcta acagctttgc 
8641 actgttccaa taaagctaac agctaaaaat cttgtacaat ttttggttac taatcagacc 
8701 cattatgtta gggttctccc tactttgaga aaaatcagcc caaatgtcta actctggtaa 
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K .t)! taaatatttg aatgagttta ttctactgta tafcattttac acacataaaa cttggtttct 
bSSl tataaatttg aagttttgat atttttttga ataaaaactt tgagacataa ttcatcattt 
"id3i accttttata ttacaattca tcaagatttt tattttatca catttctaaa ctactgaata 
aaagattcaa ttcttataaa tgtggagtca gtaaagaaac aacagagctt gtgtatacac 
agggaaagag tcttccgtct ttatgtttag tttcaccaat aatccataca cgagcatgag 
VCOl ccatcttgga aatggtgaaa acgtttccgg tctcgtaccg ttatttttct cttgtaaact 
'r. ttggagatga gfcttgatgga cgaatgacaa tcctcacaga tacgtaggtt cttaacaatg 
'>18i cgtattggtt tgccagggct cgtgcttatc aacccaaacg ctgtagcgat cttctcgctg 
9 241 tggtaatgaa gtacattttc tttatcttct tcctccaagt ttaacaaaac ctgtgtggtg 
9101 attggtctat acccagctag tctcagtttg tctgagattt ctacgagcat tgagttgatt 
9 361 tctttggctt ttgggtgtga atcatcttcc acaacaaact catgaagcac gccatcaatg 
9421 tcgatcaaac tacacccagg gtcttttctt atatccttct ccttcattct cagcctcatc 
94 8 i tctgaaacct ctgaccagtt tccttgagaa gcatacatgt ttgagagggc gacataagct 
'■>">4l ccactgtcat gaggaaccat atccatcaga atattggcaa cgcgcttgcc catctcgaca 
'iiiQl tttcGctgca ttctgcaagc acccaaaagt gctttccata tcacatcatc tggttttata 
9fe&l ggcatattaa gtataaactc ctcagcctca tcaagaagac ccgaacgacc caataaatca 
9721 accatacacc cgtagtgttc aatcctaggc tctaaaccat ccacactcac catctgactg 
97 8 5 iiaatatcUcc ttccttcttc taccaacccg ccatggctac aagcagtcaa aagattaatg 
984 1 taggcaacat cactaggtcg aactccagcc tgcctcafctt tacaaaaaca gtcaatagca 
0901 tcacctgcct ggccatgtat agcaaacccg ttaatcatcg cactccaagt aatcacgttc 
1 tctctciggta acctctcaaa tacatgaatc gctttctcaa tgatcccaca tttcgaatac 
10021 atatcaatca aagcagagcc aagcacatca tcaatcctaa ttccactatc ctctgcatac 
1008 1 aaatgcaacc attctcccaa ctctaaagac ccaagccgtg aaatagcagg aagaacactia 
10141 accaaagtaa cataattcgg tctaatatcc cctttcttca tctcacgaaa aacctcaaca 
10201 gcatctttga aaaacccgtt cagcgaataa cccgatatca tcgtattcca cgaaacaacg 
10241 cttctttgac gcattttatc aaacaacatc cgtgcagctt tacaatcacc aagtctcata 
10121 taaccatcaa tcataacatt ccacaaaact atttcaccat ctctctttct tctatcagtc 
10181 atcaccacca tatctttctc tataatgttc ttataaaaca acacacgagc atctttcata 
10441 aaaccacaca tcacatacat tctaacaaga ttactcatca caaattcatc accaccaaat 
lO'jOl ccatacttca aagccaaacc atgaatctgc ttaccttctt gaatcttccc agttt-t;cgca 
lObfcl caagccttaa gaacagaggg aaacgtaaac cgattcggtt caacaaactc atcactcatc 
10621 atctcgtaga ataatgtgat tgcgatcaga gctttgtctt cgtcactctc actgaaacct 
10681 ctgattatag tattccaaga gaagcaattt cgttgaggca fcttgattaaa gatcttatgt 
10741 gcgtagtcaa ggtcacggtg gtggagatcg gaggtagcac agaatcggag aatctctgcg 
10801 gcggcgagag tgtctcgcat ttgaccagat ttaatgaaaa cggcatggat ttgagagagg 
10861 tctcttatag ttcgacagtt gttgatttga ggaaagagtg aagatggatg actcgccgga 
1092 1 gaagacgccg gcgagttacc gccgggagag aaaagtgtct gtgttgggtt cattgagaat 
10981 caacaaagat tcctacgtgt ggctatgtaa gacgcataac aagatataat aaagatfcgtg 
11041 gtcgaaaacc gattaaaccg aaccgeiataa accgatttat tcagttatcg gcgctgaagc 
11 101 cgcaaggagg agttaaccga tcggcgaaat tattttggaa tttcggtctt tggatcggtt 
11161 cggttcggca actctgaacc gttttgttgt catatctaaa gatagttaga actctaaccg 
11221 aataaccgaa taatcttatg catttgcgca gacaaagaga tttgattatt tatgagacaa 
112S1 gaggaagcca agttgggact aaactaatgc aaagcttagc atgagagtca cgcaagcaga 
11)41 agcttagaag cacattccac aaaCccgacc tttacactac tactattact actacaaata 
11401 ggcttcaaca ttagagacta tgcacttcac ctttgcaaca accatttaag tacatacata 
1 : 4 b 1 ctcccaaaat ggctgcctct gtggaatact ttccctatta ctcaccaccc tcccatcaac 
I IS 2 1 atcctctccc ttctccagtt ccaccccctc cttcccacat ttcccctcca ccacctccat 
11S81 tttctcctcc tcatcaccca ccaccaccac atttttctcc tcctcatcag ccaccgccat 
11C41 caccatatcc tcatcctcat ccgccaccac catcaccata tcctcatccc catcagccac 
11701 cgcctccacc teacgttctt ccaccaccac caccaacacc agcaccgggg catcatgtta 
lllbl tcattgtggt tgttatctct cttggaagtt tgttttttct tgcattcctc gctgctgctc 
11821 tcttctgcta tctcaagaaa. agaagaaagt cbtctacaaa ggctgagatt atcgagtttg 
118S1 atgaacacct caaagtccaa gaaaccatag tgcaaggacc ccatggcgaa caaacgagag 
ll'>41 tggtaatgct tgaagaggac attcatttgg tagaggatat tcacaagact gaaaagctta 
U'OOl gccgtcctag tcacctcagt tcaactggaa gacatgccat tgatatttca gatcccaatc 
i'JGGl atcattttac tgagcaaaag tcctaaatca aaaggctcaa aatgtaaaca atcatattta 
12 12 1 ttcacaccat ctaactaata aaaacaattt aaatatggta ccaaatcttg caatccaaaa 
12 18 1 cttgatattg attccaagat aaatgctgta cahgtccgtc caattaatac aacttaatat 
i::24 1 tttactatga tttccacttt gcgaatgtaa gtaaccttat atatttatga tcattgattc 
12 JQ1 cgtttgctgg cttttaatgt agattcctag tgtactctgt -ttggtttctc aataacatfca 
12 361 ttccatttaa ctcgagtttc agatgtttct attaaataag taatgaaagc agctctaatt 
12421 ctcccaaaat tcacaagtat agaaagatag aaagatagaa acagagttat tgcatggaag 
12481 ctgaaaatca tacagcatgg tggatataac caatgagaca caaaacaatc acggcatgtg 
12541 ttatagaaca acacttatca tctgtattaa cataaagcaa aagttcctat acgacatagt 
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12601 ccaaagttac agaccataag cacaccttat gatttgttgt accctctaaa acacaggatg 
12661 atgacaagaa gcaaatttgc ccaaatttac tgagggaaaa acgagggctt aacgctctca 
12721 tatctcaaac aaaaacctct caaacagttt catgtgttca gattggaggg caatggcaac 
12781 ggataagctg ccatcattag taggacttgg tagcacaaaa gacaaaccct cgtatggaat 
12841 tccaccaggt cccataaaga taggacgacc ccaaccaaag tctgcatcat aaataggtaa 
12901 tctaacccag cttgtgattc ccaaatttgg gcacttgtag gtatgtgcac cgcggacaag 
12961 ggctgacaga tcaggctgca tctccaggta gtcaagagct gacctcagat agttatcatc 
13021 catacgaacc aagaaatcat gaatctgtcc tgcagcatac catgttggct tagataacaa 
13081 atcccctgca acagccaatg gtgttgcagt gaatatcaca ttcccaaagt aaccaggagg 
13141 cagctgcgga cgtagtctag accttccatc agttgcaatg tacagtttcg tctcttggtc 
13201 gtttggaagc cctcgcgcct ttcccactga tctccacaca tgccctgcca acatctcgta 
13261 tgagctgtag ctgacagtgt tcccatcctc ctfcggatttc gccttaagag caacaagctg 
13321 gtctcgtgtt aatttgaata tagagacagt ggtattctca ggtcctgatt tagacggatc 
13381 aagaggtatc ttcatacttg gtgcaggctg atattcaaca tgatgaaaag caggctgtgg 
13441 cgggtcccta gctcggagga gtgttcgatc aatgaaaggt ggaatggtta ggtcaagacc 
13501 acgagccata tcagaccatg tgttgataaa atgaagacca gagaaaccat ctgccgcgtg 
13561 atgttgcatc ccaaccccaa gtgaagctcc cccacattta aagaaagtca cctatatcat 
13 521 caaacaatat cattacataa ctaaaaacag aacatacatc atcaaagaat atcattacag 
13681 aactaaaaaa cagaacaaca caagcattca ataaattaaa aaagggaaag actttaaatt 
13741 aagctgacct gcaaaacgag aagcgggaaa gagtgaatgc cagcggagtg atcaacttcg 
13801 ggaataagct gacggagatt aagggtagga gcaaaatcac caaaatcatc gataacagaa 
13861 ggagtatcag ccacaacgaa gagaacacca gcaccgttac aatcgatctc aatacgacca 
13921 tcatcgtctc tcttcaagcg accagccata gggtaaaaag ggacaagggc tttggaaaga 
13981 gcttccttca tgacctgagg gtcaaagaaa ttggaagcgc cggtgggtct gtagaagtag 
14041 acactagggg tatggaatct ggggatgaca aggtcgacgt tggagttcca aagattagtg 
14101 attggtgtct cggtggcagg ccggaccatg gtggaatctc tgatgttaat tttcatattt 
14161 ttaagaaaga tcaaacgaac aaacaaattt tgttttgagt ttgafcctgag aaattgtgtg 
14221 agaagcgagg aggggtagat ataagtgggt gtgaacaaaa aaggaaaggt cgtaatcctt 
14231 cacctaccgc cccacgaccc cgcctactcc atctgtcgac caactctctc agaaggttta 
14341 tcattggtcc atctcatcac tcggtcctac ttttctatcc ggggttggta aacccgttcc 
14401 ctctgtccta aggcccaaat aatactaatg ggcttaaata gtataaagtb cgaatgaaga 
14461 agaaggaaaa aaaattgaca agttgattgt tatagctacc gtacaagtta gtcaatcgaa 
14521 tgccaaagtt tggcttaact taactcatta ccacgtagca gagcataaca acaaccacca 
14581 ataacatcta acggcgtaaa ctcacgtgtt aagagaatta cgtcagaatc caagaaagtc 
14641 tttttttcag tacaccacaa ataaattatg ttgaatttat atccgaacta tactatatta 
14701 tatgaaaaca tacctcaatt agtcataaat ttcacatttc atttt;iatt:t afctaactaat 
14761 ggttgttgtt gcggatagct tttctttgct ttctgttcta aaattatttc ttagccatta 
14821 tttgatgatc caacttggtt tgaggatcta ataaaaattc aatttctttg aagcattcat 
14881 tggcttaatg atccaaacct tttgtcctcc cactctctaa gaaaataaag caaaagcaaa 
14941 actaaaagaa aagcaactag taccgtgggt tggtcaatgg taacccatgc taccaaatac 
15001 caatcaaagc ttaaatgaca atttagtacc ttgggttggt catgatttag agcggaacaa 
15061 atataccata catcaaacga ggatatacag agaaaattca tggaagtatg gaatttagag 
15121 gacaatttct cttctgggct acaacggacc ggcccattcg ctcatttacc cagaggtatc 
15181 gagtttgtgg acttttgatg ccgctagaga ctattggcat cggattgaaa aaaatgttta 
15241 cttcgttgtt aacaattttc tgaatgcaat attctccttg tcatgaatat ttaaacttgt 
15301 tattactttc ttttagctta ggtgtggaca attatggagt ttacttcaaa cgaggaagaa 
15361 tcttaaacgc tcggttcagg tctcgaaaac aaaccaactc acaatcctga cttaattgag 
15421 gaaaacaatg caaaaccaca fcgcatgcttc catatttcta tcataatctt ataagaaaaa 
15481 acactactaa gtgaaatgat tctgtatata tataaccaat gccttttgtt ttgtgatatt 
15541 ttatgtatat ataactattg acttttgtca tctatggata gtgtctcggg ctcttggcaa 
15601 acatatttca aagaaaagtt aatgactgta attaattaat ctgaagctag aaacagaacc 
15661 ccgaggtaaa agaaaaagac agagcacatg aagtttagta cttttatata tttaatatat 
15721 cattctttct tattgcttat ctctaaagca aaaacttccc taaaccctaa gccaaaggac 
15781 tcagatcgat gcagaaccaa gaaggcttgt tttggatttg agagccaaat gcaaagaaaa 
15841 aaactcttat gtctttgcaa atgccgattc caagaaaaaa agccttaact gtttctcatt 
15901 tttccattac attgtctctg tttttagctt tcttcatctc ctcgacctct gcatcaacca 
15961 atgaagtctc agctttgatt t;cttggcttc acagctcgaa ctcgccgcca ccgtcagtct 
16021 tctccggctg gaatccttct gattctgacc catgtcaatg gccttacatt acttgttcct 
16081 cctcagacaa caaactcgtt acagagatca atgccgtctc tgttcagtta gctttacctt 
16141 tccctcctaa catttcttcg tttacttcac ttcagaaact cgttatctcc aacactaatc 
1S201 tcaccggagc tatatcttct gagatcggag attgttcgga gcttatagtt atcgatttaa 
162 51 gctcaaatag tcttgttggt gaaatacctb cgagtctagg gaagctcaag aatctccaag 
16321 aactttgttt aaactccaat ggcctcacag gaaagatccc accggaactc ggcgactgcg 
16381 ttagcctcaa gaatcttgag attttcgata actacttatc cgagaatctc ccgttggagc 
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16441 tcggaaagat ctcgactctt gagagcataa gagccggagg aaactcagag ctttcaggga 
16501 agatcccgga ggagatcgga aactgtagga atctcaaggt cttagggtta gcagcaacga 
16561' aaatatccgg ttctttacct gtttcgttgg gtcaactaag caagctccag tctttatctg 
16G21 tctattctac aatgctctca ggtgagatcc ctaaagagct tggaaactgc tctgaactta. 
16681 tcaatctgtt tctatacgac aatgacttgt ccggcacgct tccaaaagaa ctaggaaagc 
16741 ttcaaaacct agagaagatg cttctatggc agaataatct ccacggacct atccctgagg 
15801 agattggatt catgaaaagc ttaaacgcca ttgatctctc tatgaactat ttctcaggaa 
16861 ccatccctaa atcgttcggt aacttgtcga atctccaaga gcttatgctt agcagcaaca 
16921 acatcaccgg gtcaatacct tcgattctaa gtaactgcac aaagcttgtt cagttccaga 
16981 ttgacgccaa tcagatttca ggtttgattc caccagagat tggattgctt aaggagctca 
17 041 acatcttctt gggatggcag aataagctag aagggaatat tccggacgag ttagctggtt 
17101 gtcagaatct tcaagctctt gacttgtcac agaattatct caccggatca ttacctgcag 
17161 gtttgtttca gcttcgtaat ctgactaagc tcttgcttat atctaatgcc atctccggtg 
17221 ttatcccact ggagatcggg aactgcactt cgcttgtcag attaaggctt gtcaataaca 
17281 gaatcaccgg agaaatccct aaaggtattg gatttcttca gaaccttagc ttcctcgact 
17341 tgtctgagaa caatctctcc ggtccagttc ctttggagat aagtaattgt agacaactcc 
17401 agatgcttaa cttgagcaac aacactcttc aaggttatct tcctctgtct ttatcatcgt 
17 451 taacaaagct: tcaagtactt gatgtctctt ccaatgactt gactggcaag atacctgata 
17521 gtctaggtca tctcatttca ctcaaccggc tcattctaag taagaattct ttcaatggag 
17531 aaatcccttc gtctctcggt cactgcacga atcttcagct tcttgatctc agcagfcaata 
17641 acatctcCgg aactataccg gaggagcttt ttgacatcca agatctagac attgcgttaa 
17701 acttgagctg gaattcatta gatggcttta tcccggaaag gatttccgcg cttaaccgat 
17761 tatccgtgct cgatatctcg cataacatgc tttcaggcga cctttctgcg ctgtccggtc 
17 821 tagaaaactt ggtttctctg aacatcfcctc ataacagatt ctcaggttat cttccagata 
17881 gtaaagtgtt tagacagctg ataggagcag agatggaagg aaacaatgga ctctgttcca 
17941 aaggtttcag gtcttgcttt gtaagtaaca gttcacagtt aactacacag cgtggtgtgc 
18001 actcacatag actcaggata gccattggat tgctaatcag cgtgacagcg gttctagcgg 
18061 tattaggcgt gttggcggtt atacgagcta aacctaatgat tcgagatgat aacgattcag 
18121 agactggaga aaatctatgg acatggcaat tcacaccttt tcagaaactc aacttcacag 
18181 tcgaacatgt actcaagtgt ttggtagaag gtaatgttat agggaaaggt tgctccggga 
18241 tagtgtacaa agctgaaatg cctaacagag aagtcatcgc ggtgaaaaag ctctggccag 
18301 tgacagtacc taatctgaat gagaaaacta agtcatcagg agttcgagac tcattctcag 
18361 ctgaagtaaa aacacttgga tcgatcagac acaagaacat tgtacgcttc ttgggatgtt 
18421 gttggaacaa gaacactaga cttcttatgt atgattatat gtcaaatggg agtttgggaa 
18481 gtttgcttca cgaaaggagc ggtgtatgta gcttaggatg ggaagtgagg tacaagatita 
18541 tacttggtgc agctcagggt ttggcttact tgcaccatga ctgtgttcct cccattgttc 
18601 atagagacat caaggcaaat aatattctga ttggccctga ttttgaacct tacattggag 
18661 atttcggact cgctaagctt gttgatgatg gcgactttgc tcgttcttcc aacaccattg 
18721 ctggttccta tggttacata gctccaggta cttcccaaat ttctgttttg ctgctaatta 
18781 gttaacttga tacatcataa gttactagta gtcttagcct agtaatggta gaaattcagg 
18841 tttatagttt ggttttggtt tttggcattt tgaaaagaat agtattttgg ttcagttgat 
18901 tttcgatttg gtttaaaatt ttagaaaaaa aatggtagaa gaaataattt ttgattaata 
18951 ttatatcatc tactgtatac gatttaaaaa aaaaattgat ttggtttggt ctcgatattt 
19021 tgcttgtagc ttggttcaat atttttacta tatatatgat tcatgacttt catactaatc 
19081 aatatttgtt cggtttcgtt ttgattagtc tgaaagtgaa tcatatagtt aaaatattga 
19141 ccaatctatc atatacttac acgctacaag attttcgatg taattttggt gtaagcattt 
19201 tggtttttat at.gttcatcc ttagtaacta acaacaaata tcaattggga agcataacgt 
19261 tatatattcc cataaggcta taacactata attatgcaga atacggatac tcaatgaaga 
19321 taacagagaa aagcgacgtg tacagctatg gagtcgtagt gctcgaggta ctaacgggta 
19381 agcaaccaat cgatccaacg ataccagatg gactccacat agtggactgg gtcaagaaaa 
19441 tcagagacat acaagtaatc gaccaaggac tacaagcaag accagagtca gaggttgaag 
19501 agatgatgca aacgctagga gtcgcgcttt tatgcattaa tccaataccc gaagacaggc 
19561 ctacaatgaa agatgtggct gctatgctta gtgagatatg tcaagagaga gaggaatcga 
19 621 tgaaagttga tggttgctcg ggaagttgta acaatggaag agaacgtggc aaagatgatt 
19681 cgacttcatc ggttatgcaa caaacggcta agtatttgag aagtagtagc acgagtttct 
19741 Gtgcgtcttc tttgctttac tcttcttctt cttctgctac ttctaatgtt agaccaaatc 
19801 ttaaataaat gcatgggtaa ttatgttttt aagtgaacaa gtttctatgt ctaaagtttt 
19861 tgttttatct atatatgacg ttgagatgaa ctcggtgttt caaaccaaga aaaattaacg 
19921 gagactaatc tcgtcttacc ttttacattt tctcccgtca aggcgtcaag ttgatttttt. 
19981 tttgaaaatc aaataaagaa aaaactagtg aaaacagata" tgaggaggtc ataactcata 
2 0041 actgtgactc acgagataaa tcatgtatcg tggctgacgg tcacggaacc ttttgctttg 
20101 fcacccatttt ttcgaggttc gtccaggaag gcggataatt agtgtacctt cactatatac 
20161 tctataactt aataatttta tatcaattag atacaaatgt ttaattaaaa aataaaatag 
20221 caaaccctta accaagacca gtgtgfcatat taagtaacga aataaaattg aacagaaatt 
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202B1 gttgaccaga taaataataa gtatagattc atgttttagt ttttttttaa cttcaatttt 
2 0341 cagatttttt ttgtagtcta aagtttggtt gatgaaaata aaataaaata aaagaaaaat 
2 0401 ctatatttac ggaaatatat ctcttacaaa taattctttg caaatgtgga agacctatta 
2 0461 tacatataga catatatatt aacacaatca ttatgagtaa gtatcttaat attataaatt 
20521 gatgaaaatg ttaaaaaggt tataaatcta tatatatttg tctatttaaa aatggttgaa 
20581 aatcttctfct tcaaaaaatc aataattgtt tttatgaaac agtgaagctt aatctttaag 
2 0 641 aagtccctca aacaattttt accataaaat aagatttcaa attataacat acctaatagt 
20701 tttatctata tattctaact atccatgttc cr,atggtata ataatatctt ctaacattat 
207S1 ttttgaaact tgtttctctc tagttttttt tccctctcraa agcgagatcc aagtttattt 
20821 caagagttac caacataaac aatggcaaca aataaatttt tatctatttt gttgctttcg 
2 0881 ttgatggcgt ttgctgctat tctgctgcca atgatttcag gtaacacatc tattatctat 
20941 taaaatttgg tttgtaatgt tatgataaat aattctactt taaatatgaa agttaccgtg 
21001 tgtttctatg aaaaacaaaa cctaacctaa ttttttgttg aaggttttaa ttagctcatg 
21051 ggtgttaaac ataaactatc aaaaaatata accaaaaaaa taaagttttt aagttgtata 
21121 tttactttta tgtgaaaata tttctactcc actacgacaa aatgtcactt ttgtaggttt 
21181 tttgtgaagg tcacaaattt tagattacga gtgcaattag agtaacacaa atatttgatt 
21241 ttgaactaaa cagtcgtggt tttaaaaata gfcgfctcaaaa tcagaagaag aaaattgcac 
21301 tatgtagtaa gaaaacagtc gtttattgaa atfcgatcaafc ttttatttta gagtcgttta 
21351 atttgtttgt ttataaacta aacaggacaa atcataacat gtttaccggg cgaatgtaca 
21421 aacccatcgg aatgcaatgc a,gcfctgcaaa fccaaatggat acaaaggagg ggcttgtgta 
21481 agcatgagca taggctcaac aaccggtgct tgttgctgta aacctaattt caaatctcaa 
21541 gattctttca aatctaatga tatcattatt aataataatt agtacatcat tagatatatg 
21601 tattttgata tgaataaaaa tatattcaaa tatgtttcga tattgaacca tctataatat 
216Si ggataatcat tcgtaagtta tagtacgatc caaacatttg atcgfcatata ttaataaagc 
21721 atacgaat:gc tgaaatttat gtitaactata ctcatagaaa tttatttata tctgccatac 
21781 caatacaatt taaaagattt ttcaattaaa tgagagataa ataatttatt ttgattattt 
21841 ttttgt-ttct gttttgtcaa caatatgtaa tatgccgatt ctaaaaaaaa aatcatactc 
21901 tatatcatat attgtctaac cctctagtta gttcaaacaa cataagtaag ataacaagaa 
21961 ttaaaaataa atttaaatga ttttgatatc atatattgaa ttaaaatttc aaattctatc 
22021 aaaattcatg attaaattaa cacatccaaa tgccgaaaaa aaaaaaaaaa aaaattaaca 
22081 catccaaaaa- ttgtcacaaa tcgtactaaa ggcccagtaa gtcttttfcgt caataagtcc 
22141 attaagccca acaaacctct tcttctcttt cgttttatga catcaatcca tttgaagcaa 
22201 acaaatcaga cgtaaccgaa caaaaaaaaa aagttggttt gatcgatcgt taaaacctaa 
22261 gaaatggatc caaaatcgcc ggaatttatc attgatcagc cgcttaaaat cttaggattt 
22321 gtattcgatg agctctccgc caccagagtc tccggccacc tcaccctaac tgaaaaatgt 
22331 tgccagccgt tcaaagtctt gcacggcggc gtatcggctc tgatcgctga ggcactcgcc 
22441 agtctcggcg ccggaatcgc ctctggtttt aaacgcgtcg ctggaatcca tctctctatt 
22501 caccacctcc gtcctgctgc tctcggcgaa attgtcttcg ccgaatcttt tccagtctca 
22561 gtcggcaaaa atattcaggt aacctataca fcaacactgac caatacacat atatacgcag 
22621 ctatctttag actttaatgt atataaaatt atattaacta gaaaagtttt tagtaatatt 
22S81 aaattttaat attgtgtcaa agtaatgtaa aaataaattt tttttaaatg taaaaaaaaa 
22741 aattaatatt ttgttgaaga agttaaatat aattctgcaa tttgagagat aacataagat 
22801 aaattttcac gtattagtaa gtaaaagtga caagttatgt tttataacca ggtgattttt 
22861 aaaaaatgaa atttgttttg atgcaatatc gtgtgaaaag taagaaaact aatatatgca 
22921 ataccatatt taaattgtga accttattaa tttcacctaa aatgaaaatt ggaaaacttt 
22981 ttcatgtatt tgatagactt gtgagtttga attgtataaa tttccggcaa gcaaaagcag 
23041 ctcgtgtacc aagttgtcgg aaactttaat tttgttagca tgcattggac taaaccaacg 
23101 caaatatttt attttattta tcgtagactg aagacctaaa gagtaaaatg aa.tgtgaaag 
23161 tgacacgttg tgtggagfcat tttgtatcta actaggtgtg ggaggtccgg ttatggaaag 
23 221 ctaagaaaac cgagacaccg gacaacaaaa taatggtatc aacatcacgg gttactctct 
23281 tctgcggttt acctataccg gaccatgtta aagatgctcc agacgagctg aagaaagtta 
23341 tatccaaatt ataaattcaa attgtatcta tctcattttg ttaggaataa gaacaatttt 
23401 agtttggttt tattgtacga accaaacgtt ttaataaatt gttgtaccaa acatgagata 
23461 ttataaggtt ataaaaattg acaagttatt gtgggtttag tttgtagatc attttgaaat 
23521 atagtattaa tagactattt tgttgaaacg tccaaatcgt atagatgggg taacaaaact 
23581 aaactccttt atatgattct tttgcafcaaa attctgtctt tgtaataact gaatcataca 
23 641 tactttttta gtttgatacg cataatataa ctaccacaac tatgtagtgc atttaagata 
23701 tctttcttca tattataaat gtgtatcaat accatttaaa tcttcgtata caaagaaaaa 
23761 aaacaagatg gatattagtt bttcgatgga ga'cattattt ggttgagttc tttcttcttg 
23 821 agacaacaac atccagtgtg gagtctaaat gttccacaaf accatccatt tgtatacccg 
23881 tggcttaagc aaatctgatc acatacgctt gtacactcgg accgataaca catggcgtca 
23941 tgaacgtcaa tagcgtttat tttagctgta aaagatatca taaaaatcta ttatatatga 
24001 tagatcatca tttatgttta tatttagaga gctctacatt gagattcgat cacgatcaaa 
24061 cggtaaggtt acttcacgat aaactaacga agatagtgta ttttgctgaa tgatgtaaat 
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24121 tatattaaaa aaaaatataa ggtatatata taaacaaacg aaggtaatat ggtttagatc 
24181 aagaaaatac cctaatagaa atcaaaaagt ttgtgtatat gttttatgaa tttgtaaatt 
24241 gacttaccgg ccatcgcaaa caagagaaca attgttatgg cagfccagtat atgtgaatgt 
24301 ttctggttct gcattttttt gttttgaggt gtcactagct aatagctatt actttgtttt 
24351 ctctttgttt ttagatgtat ccattgagat taaagtgctc ttatataggg gtttggtttt 
2 4421 cgcagataaa attaatcaat ttagctggat tttattctat acccacaaaa taaatttata 
24481 aaagctaaga tataattttg atatgatggt tcatgaaagt gtcaactttg gttagattta 
24541 aacctgcata acgcattatg aaattgacta tatatatatt aagaaagaga taattctcat 
24501 ttaattttta gtaaaataat tactcaattt tatattatac cagaaacaaa gtgggttcaa 
24661 aaattattaa ggctatatca tatgtaaata caattaagaa acatttatgt aagaaaatta 
24721 gaaactgaaa tttcatatca agaattaaca aagaaaacat taataaaatt tcttgttacc 
24781 ttaaatatac cattttattt aaatgaacat taaatcatag tattttagat ttaagaaaaa 
24841 agaactacac agaaatttct ttaataaacc taatatatat gaaaataaat agaaagcact 
24901 actgaaagaa gatccatata aattaaagat agtttggtca atgccaatca tcattttgaa 
24961 ccaaatccac atttaaggaa aattgatgtt attctttaac ccgaaaacat gttcttgtct 
25021 ttgcacatgt gtctcgccct attttttatt tcttcattca acaagtcaaa atgaatctat 
25081 tttactacat gatctctata aacgaatgtc ataaagattt tatggccttc aaacttggtc 
25141 tcaacatcga tcatgtatat gattatctga ttcatccata tagaagaggg ctaacccgcc 
25201 ccttgtacct tgctttggcc ttaaaaaggt tttcaagaat ttactttctt tcttatttga 
25261 ttttaacttt gtaaatgtca catttttccg ttaattaatt ctttcgctga gtgacaaata 
25321 tgttaatatg acatttatca tcaagatata tattttatca tatttttgat acatgactat 
25381 caaacaaacg tcacactatt ggtcagagat tattcgacat aatccataag aagttgtaaa 
25441 tatatcatca agtgtgaata aatgtatata aatattatta aaaaaaaaaa ttagtagttg 
25501 Gtttttcgcg attcgtgcga atagattctg atgccaaatt tggatattat taagagcaca 
25561 tgt.ttct.cc!: tgtgtcgtgt cttttattta agagagaatc atcatctctc actctcafcct 
25621 tctagaagtt aaaacatcac gccatcttcc gaattcaaac gccgatgttc tgtcgttctc 
25681 tgtctcacat attggaacga gcgtaaagat gcctaatttc tcagttaacg ttccccaact 
25741 ctcatctctt tacagtacaa aaacgcccaa acftgagaatg aatctatgtg ccgatcaggt 
25801 gttcgataaa aagcttctgt ggagagatat gtcaacgaag atgaaatttc cttctttttc 
25861 tgctgcggaa ttacctgatt tgaggaaaag taacaagagg aggggatctc ttaggatgat 
25921 caagtgcaga gccgccggag ctgacggtgg acgcgtggct gttggggatg atgtgttttc 
2 5.981 ggttactact tcttctaagt atgaagttga ctatctgggt caaagtacta agggagattt 
26041 gaatctcaag cttgaccctc ttcagtcatt tggtatatat acaatcttcc tttttgtttt 
2 6101 gttagttttc gaaattcaga ttggttttag tgaaatgtgg gataaagttt tggtctttgg 
26161 ttagggaact tgattttggt tgatgtttgt ttaattgaga taaataagag ctaaaaagat 
26221 cgaacctttg ttgtgaatgt cacgttgtat gagcaaaatg taactttggt gtatctatgt 
26281 aggagatggg caggctacat tggagggtcc cattgaggag gtagcgagaa cagaggctca 
26341 agcggctgaa aatttgatta gagagttggg tatccaagtt cgttatctaa tccttacaac 
26401 tttgtgttat gtatttgaag tttgagtatc cctttgttgc gaaaacafcga atgtattgag 
26461 ttatttgtgt ctggttttat atggactttt ctttttcttt tgtgtcacat tctcatgcct 
26521 ggtattgcag ggccctttct ctgcacagca ctctcctcgg ggtatatttt gtagtcgtac 
2 6 581 attgaatctt cggtccatta gtgcaattgg atatgatatg gattacactt tgatgcacta 
26641 caatgtcatg gtgagttctt cctttctcag acataagtag gaaacattta tgcacccata 
267 01 tttgatgcta atttatggtc ttgttcaggc ttgggaagga aaggcttatg actattgcat 
2 67 61 ggaaaatcta aagagcatgg gtttccctgt tgatggactt gcttttgatc cggaactggt 
26821 gatatgttta ttttcttgtg aaagttgaaa ggttccacat tgtagtaccc tatctagttt 
26881 ttagagtact caaactatgt tttctcttat tataggttat caggggtctc atgattgaca 
26941 aagagaaagg taatttagtt aaggccgata gatttgggta tgtgaagaga gccatgcacg 
27 001 gtacaaagat gttatcaaat aaagctgtca ggtatatttg acttttcaac accatCggta 
27061 atgtgaagaa aaaagagagt tgcaaagtca ttgttctgtfc ttgaaggaag tctcattctc 
27121 ttattgtctt tgtatggatg aaagtgagat ctatggaagg gagttagttg acctgcggaa 
27181 ccagagtcga tgggagttfcc tcaatacatt tttttcagtt tcagaggctc tggcttatgc 
27241 acaggtacgt tcttttatta caagaaatgg tagctgaaca atttgcagtt gattttggac 
273 01 ataatttact ttatttattt atttgtaact catataattt tcacattcgt tttctcaact 
27361 gatgactctt caccccaatg aactttcttt tttacaaaat tttccgcacg gcttctgcta 
27421 agcaatgaaa gttcaaatca tttcagatgg ttgatagatt ggatgatgga tttatttcgg 
27481 cagatcttgg cactcttgat tataaaggac tgtataaggt tgtgtttcat agaacctaaa 
27 541 tttcattgaa ctgaaatctt tgcatatata ttttttaatc taatcatttt ttccaggctg 
27 601 ttgcaaaagc tctcttcaga gcacatgttg aaggacaact taaggttgat agtttgtcca 
27661 ttccataatc attttcttaa atatcagttt tctccggtcc ctcttgaata atagttactt 
27721 tgcagagtga gataatgtcc aagccggaac tatttgtcga gccagaccca gaactacctt 
27781 tagctctttt agatcaaaag gaggtaatgg ttcagtcttt acacaatata tctaactatq 
27 841 atggtttcag ttgttaatac tgtcatgttc gttcataggc tggtaagaag ctcttgctta 
27901 tcacaaactn ggattatcac tacacagaca aaatgatgaa gcattcattt aacaaattcc 
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27961 ttcccaatga catggactgg cgagatcttt ttgacatggt gagcaaacgt aaatttgttt 
28021 ctgcafctttg attgtaaatc tgttgttttt cagagagtag tttgcattag actttactgt 

28081 ttttgaattc atgtttccac ataggtgata gtttctgcga ggaaaccaga gttcttccag 
28141 atgtcgcacc ctctatatga ggttgtgact ggagagggtt tgatgcgtcc atgcttcaag 
28201 gctgaaacag gtaatttctc attgtttcta ctgaatttaa gctgataact ttacggtgga 

2 8261 gaaagctaaa attttcttcc tctcactgtt tgtctcctat aggaggtttg tactcaggag 
28321 gaagtgctca aatgatagag agttcactca acgttcatgg agatgagatt ttgtatgttg 
28381 gtgaccacat ctacactgat gtcagcgtat ccaaagtcca tctcaggtgg cgaactgcgc 
2B441 tgatttgccg tgaactggaa gaagaggttt ggataaatta tactctgaac tatatcaata 
28501 gatgacagaa cttgaagggt tttcgttcgt tgaagtaaca taaaaaccct gattttctct 
28561 tgtttctttg cagtatatgg ctctaattgg cagtcgtggt caccgagaag agctaataga 
28621 gcttataaat caaaaagagg ttgttgggga tctctttaac caacttcggc ttgctcttca 
28 681 aagacgaagc aaaggccgtc ctgctcaggt ttgtttatct ccttacaagc tctagagtct 
28741 atcttctctc cctgtgtgga acacaatgtg tatgaacaga taaatagata gagatttgtg 
28801 acggaaattg cttataaaga tatccgtttc tcttttcctt ctagactctc gctgctacca 
28861 acttggatga tcaagaactg acagagacca tgcaaaagct tcttattgta atgcaaagac 
28921 tagatgacaa gattggtcta atgctggaaa cagatggcga gctctttaac aaaaggtaaa 
28981 aagaaaaaga caaacaaatg tagtaagatt actcaagatt cacagaaagg gatatgtctg 
29041 agccataatt cttgggctgt tttgcaggtg gggcttcctc tcacgcgcgg gtttgtggga 
29101 taaaagccac ttgatgagac aaatcgaaaa gtatttatgc ctcttctctg catttccgag 
29161 taactcataa gctcaaacca agctcacatt taaactacac ttactttctt caaaattggt 
29221 ttcaggtatg cggatatata cacatcaaga gtctccaact tcctcaacta cacacccttc 
29281 atatatittcG gctcacaaga gcaggtaaaa agcttaaatc agattatcag tatagataca 
29341 aacgaaatct gaggcagatt ctaactaaac tcactcatgt tttttctctc ctctgcagtc 
29401 actggctcac gattctccgc ttccagatgc gggtatagaa aactagcggt tgtgaccaaa 
29461 gttgaggcaa cttgtgtgtt gtaatatttt tagatgaaaa aactgtggat agaagaaatg 
29521 taacaataga aattcttaag ctcaaaagtc tactttatgt ataaaatgtg tattcttatt 
29581 ctctgaattg aagttaaagc tggccttggt tggtatgttg ctgttgtttt tatttactgt 
29641 cttatctttt ttaagctttt tcaggctgac atcagaggca gattgtgcaa atgactactt 
29701 tfcagagaaac attatacttt gagattttgt gaattctgat gactgttata tcaaccacta 
29761 gacactagac agtgcagact tctatataaa caatcgattc gttgtgagca cacaattcat 
29821 tacttcaatt gcaagagaat taataagaaa tctctcaatc tctttattat tcagtatctc 
29881 gatctttcgc agataacaaa tttgtsgatat tactatacga aacatctaag acctatgatg 
29941 cagaaaacgc accgttttcc ttacgtcaaa gtCtgaaaca tacggactct tctctgtcgt 
30001 tcttattcgc tcttctggat tttgtttctc caagcaaaaa aaactccgac tttcaacgaa 
30061 aactcagtgc ctcgtccctc tctttcaaaa aggtactctt tttttatcac ccagcttagt 
30121 ttctagggtt caattttgga tcttagggtt ccgtaatttt aagattctcg aaacatefctt 

3 0181 ttgccttgcg ttagctgagg attagaatcc caatttgagt tggataccta aaagtcagat 
30241 tcttttttgt tgtaaatttg aaggtttttg gttatggcga cgaagaaagt gatagctata 
30301 tgtcagtcag gaggagagtt tgtaactaat aaagatggct cattagctta ttctggtggt 
303 61 gatgcttatg ccatagacat cgatcaggac acttgtatga gcgatttcaa atccgaatta 
30421 gctgaaaatt tcgggtttag ttgggagaat atgacactca agtacttcct ccctggaaac 
30481 aagaaaacct taattactat ctctaaggat aaagatttcc aacgcatggt tagcttttcc 
30541 gcagacgcac caaatgccga aatcttcgtc ttacccgaag aagccgaagc aaggaatgtc 
3 0601 tccaacatgc cagctagtag gtgtgtgttc tagttctaag acctaacggt ttatcgacat 
3 0 661 tgtttatgat tgtaatgact gatgtgtgga tttctgtatt tgacagatca agtaggacaa 
30721 ctgcatcaga agcagtggta cctgtagtgt ctgcaggaga tattgttgtg ggagatgata 
3 07 81 tgatagcaaa tgattttcag attgaaatgg gtataatgcc tgaagaaagc agccctttgc 
3 0841 cgtgtaattt tgttttgact gatgagaaac aacatataaa agctgcacag cagtgggaaa 
30901 acgctattac gggtgttgat caaaggttca atagctttac tgagttccga gatgcgttgc 
3 0961 acaagtactc gatcgcacat gggtttactt acaagtacaa gaagaacgat agccatcgtg 
31021 tttcagtcaa atgcaaagct caaggttgtc catggcgtat cactgcatcg aggctttcga 
31081 ctacgcagct gatttgcatt aagaaaatga atacgaggca cacgtgtgag agagcggttg 
31141 tgaaagctgg ttaccgtgca agtagaggtt gggttggaag tatcataaag gagaagttga 
31201 aagcttttcc agattacaaa ccgaaggata ttgcagagga tatcaaaaga gagtatggga 
31261 ttcaactgaa ttactcgcag gcatggagag ctaaagagat tgctagagag cagcttcaag 
31321 gttcttataa aaaggcatat agtcaacttc cttctttctg caagaagatc agggagacaa 
31381 atcctggaag tattgccatt ttcatgacaa aagaagattc gagtttccat cggcttttca 
31441 tatcgttcta tgcatcaata tctgggttta gacaaggttg tcgtcctctc cttttccttg 

■ 31501 acaccgccga cttaaactct aagtaccaag gggttatgct tgttgctaca gctcctgatg 
31561 ccgaagatgg aatatttcca gtagcttttg cagttgtgga tgctgagaca gaagataact 
31621 gggtttggtt tttggagcac ctcaaattgg cactggccga tccccggaca atcacatttg 
31681 ttgcagattt tcaaaacggt ctgaagactg cattacctct ggtatttgaa aagcaacacc 
31741 atcatgccta ttgtctgcgc cacctcgctg agaaattaaa catggacttg caggctcagt 
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318 01 tttctcatga agcaagacgg tttatactca atgatttcta tgctgctgct tatgcgacac 
318 61 aacctgatgc atatta^ccgt. fcccfctggaaa atataaaaag catttcccct gatgcttata 
31921 cttgggttat cgaaagtgag cctctccatt gggcaaatgc tttgtttgaa ggagagagat 
31981 ataaccacat gaactccatt tttgggctag atttctacag ctgggtttct gaagcacatg 
32041 agttgcccat aactcaaatg atagacgaac ttcgagcaaa gttgatgcaa tcaatatata 
32101 cccatcaggt acaatcccgg gaatggattg tttcaacgct aacaccaacc aacgaagaga 
32151 agctacagaa agaaatagaa cttgcaaggt cacttcaggt ctcagcgcct cacaatagct 
322 21 tattcgaggt tcatggtgaa accatcaact. tagttgacat caatcagtgt gattgcgact 

322 81 gtaaggtatg gagactaacg ggtttaccgt gtagccacgc agttgcagtg gttgaatgca 

323 41 tagggaaaag cccfctatgaa tactgctcca gatactttac ctcagagagt taccgtttaa 

324 01 cgtatgctga gtccattaac cctgtcccta acacaacaat gacgatgatg attttggagg 
32461 agccgcccgt tgaaggggtt gtttcggtta ctccaccgcc cacaagacta acaccaccag 
32521 ggaggccaaa gagtaagcag gtggaaccgc tggacatgtt caagcgtcag ctccagtgta 

325 81 gcaactgcaa gggtttagga cacaacaaga agacctgcaa agctgtatct taagtattag 
32641 ggagagagag agaaatagag aggagagaca tggttaggac aaatttgtag ttatatagct 
327 01 gactttgccc aacattttgg ttttggttct tctgtttgtt gggtttttga gaatgatctt 

327 61 aaagggtaaa aatcfcccttc aggttgaaac ccaaacgtga agagagcaaa gcggctcaga 
32B21 afcctctgfctc ctaaatggca ttaaacttga tgaaagattt tacataaatt tttttgtatg 

328 31 taatttgatt tttaacaaaa aatttcaatg aatcttttca attacatata ttttgtattt 
32941 cgagctttac aaaatccacc acacttgtta aaaagcttat gcagttgccg acaacacttt 
33001 gacgtttaca ttccaaaaat gatcgatgtt catatgagcc tataataaat gctagagaca 
330 61 gtaagatata agtatcaaaa gagtaaaatc tctaccaaga aattgactcc aaccaatcgg 
33121 aggtaaactc ataggccaca aaggttacag caccagcagg agcagctttc acagtcgatg 
33181 gcacaatgcc tttgtacaga ccaCgccacc cttctgatat cataatctgt ctaagaccgt 
33241 ctiaacatgtt tctgtatgca cgtcgctcca ctcgggctcc atatcttgga tgtctctgta 
333 01 aaccttcaat ctgcaacaaa agaaactttc aaaatctatc tggcatttct attatttcat 

333 61 gcttgaaggc aattgcattt tcattgcgtt ggagatccac tggcaagttt gaaaagcttt 
33421 tatatttaag aaatgtttca cacaaattgg agttttcttt ctataggtac ggtgacagaa 

334 81 caagtggggt acccggaatc agfcaagtaga ctcagcaagc acatatcata gggcagtaaa 
33541 tgtgaaagtc tagtctaaga tacataagga tagacccaaa acatgaaacc acatgaatac 

336 01 gataagtcat tgcgctagtt cagtgtagga gcattgtaat taccctagct ggtttgtagt 
33661 agatagtaga tgtgttgaca actcagcaac tatagttgag taagttagca agagagctaa 
33721 gtggctaaaa ggaagtfctgg gatgcgtaat taggtcgagt agagattctc attaatcaag 

337 81 tctctctatg atgtgtagtt cttgctgaag tgaataaata aggagtttaa tagtttggat 
33841 cctatcacat tatcttccca gagaatccat tgggtaaaga ttaactgacc tggaatcgtt 
33 9 01 ttttgaccac atcaagggga tggcagacga gcttggcgct agtgcccgcc ccaagcccac 

339 61 aaatgaaaag ctggaagcta gaaaggttgg tgtccacatt gatcgggatt ttggaggata 

340 21 atttgtatcg attccagtcc tgtttaaaaa taatcaattg agttgattat catatgaaca 
340 81 ttgatcacac ataaactata tattcctatg ctcttaccat catccaacgc ttaaacatat 
34141 catatgtgcc aaattgcagg ccagcgtaag gcacaatctc aacaagagtt ggtgttaatc 
342 01 cattatacaa tcctcttata cctcgagact gaataatatc gacaaaagca gacctcafctg 

342 61 ttggatacac ctgttgacag tcatcagtca gtcatctata tgtcacttaa gcatgaataa 

343 21 ggaagaaaaa caagaaaatc cccagtacat ggaatttaaa ttatattaca ggttcataag 
343 81 tcacgcaact aaccttaggc tccccttgtg aggccaatat tgtccttaaa agatcaaaag 
34441 gataagatcc tagagtagca gcacatcctg ctaacgctcc actgacgaag gatagataag 
345 01 ggctcaaatg tatatgatcc tctgcattgc cataagatca caaccagcta ataagtaagg 

345 51 aaccacattg atctctaaat tatacgagtc aattccaaaa gctattcaaa gaaatacaca 
34621 atfcaacaaac agaaatgata cccctaagat agccaccaga ctacaacaaa aaatagcact 

346 81 ggcaatctac caaataccat atctagagag accgcaatgt tactcagtag actaaattac 

347 41 ctgttttagt agaaccagat gcaaaagatt tcagc;ttat;g i;agtac.tgta aactgaattg 

348 01 aagtatacgg catgaccatg agcaaagctg gcacattacc ccgccaaaac ccctaaagaa 

348 61 aacaagtatc aaacatcaaa acagagaaca agacctcata aaaactaatg ccacaaacct 
34921 gaaactacta acatactata aacaaaaaag aaatggtttc agaaaacgga aatgttcaag 

349 81 agacggaccc gaaaaccttc ttctctaaat afcatctttag tcgcctgtac cattccagta 
35041 tacttagaag ctcctgacaa gtttccccga accaaacccc atgaagtcgt cggttctagc 
35101 tgaacctgca atcacatata acattcttca accatctcta aatatataaa agctaaccat 
35151 tttctaacaa tgataattct catctaaaac ctccacaata caatccaaag caacaacaac 
3 5221 aaactcaaat attctttagc taaaccaaat aaaaatctaa actttatcac tcagaggata 

352 81 gaaaaaaaaa gccaaacctg aaatctaatc ttaatgacat caagaggaga agttacagaa 

353 41 cgagaaacac caccagagat agctccggca gaagcatcaa tcagagcacg tttaatctgt: 
35401 cccggctcat catcaaccgt cgccgtcgta ggaatcatat ttttctaggg tttttggtct 
35461 ctataagcta aaatcaatta aaaaagtttc caaaacccta tcatcaatat aataaacctc 
355 21 aaacaaaata aggaaagaaa ctaaattcaa ggcaacaaat aaaatcaaaa acgaaaactg 
35581 aaatcttttt taccagataa agcgagaaac gaaaagggtc gacattgatg gagtttccgg 
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35641 ggaaaaagca aggaacCttg agatgaccct taaacgtttg ccgcggattt gagagttggg 
35701 tggagattga aaatgatttg actcggaaaa agattataaa aaaaaaaaga agctgagaga 
35761 ttctttctcc gtcaacgttc gtgttctctc cgatagagag attaaagctg gctttcttac 
35821 aaaatttgcc ctttttagga ctcttgtcgg ttcggtttaa tgaagtatca attcgtctta 
35881 aagtaattaa attgtttcaa aggggtttta tggttactct taaaccttct tcctttacgt 
35941 ttgttggatg gatttagtca tttactaatt tgaagatcaa cagagaccct tacaaaaaga 
36001 aaattactcc ttatgttcca atatatfcaga tgttttctgc tttcttcttg tttcatttta 
36061 ttatattaag aaaaaacatt atgtataatt tatgcatatt taatatgttg tgattatttc 
36121 tattctgcaa tcttttttct tttgatteiaa cttttttaaa taaaggtaga taagattttt 
36181 ttagagaaat acactaaaat acatttatgt aaaaataata eiaatagtttc ttattttttt 
36241 taatgatttt agctaaaaca ttttcttatg taaaatatga ttttcatagt tggtaactca 
36301 agtgtttgca acacatcgac atgtgtcaaa tctaattttg ttaatattta aaacaaaaaa 
36361 tatagaaaat ttttaaccct atggaagaat gaaaatacat acaaaaatca caaaaagtta 
36421 tatttctatt cataaatttt taagtaataa gatttagaat aagttaatgg gatgtaacat 
36481 tgctaagagt tagaatactt tctctttatt tttggggtaa agttgtagaa taattaatta 
36541 gttacaaaaa aaagttttta attttttgaa gtaaagttgt agactaatta attatttaca 
36601 aaattttttt tatttttttt ggagtaagtt gtagaataat taattagtca caaaaataag 
36661 tttttaattt ttggagtaaa gttgttgaat aattaattag ctaaaaagat agtatcgttt 
36721 tgtaattttt atatttatct ctaaatattg gtccaacaag gcccaatttt tgattgaatt 
36781 tgtctcaatg gcggattccc aaaggttgtc caccgcttcc ggcgtcaagg acggccaacc 
36841 accatggaag aagaagaagc tgtcgaacga tacgacttcg aatccgtcac ttccatatga 
36901 tgtaatactg atcatcttag cccgcgtctc gagatcgtat tacacaaatc tctccttagt 
36961 cCccaaaagc ttccgatcaa tccttactcc accggagctt tacaaaaccc gaaccctctt 
37021 aggcaaaacc gagaattttc tctatgtttg cttacgcttc cctgatgaag ctaatcctcg 
37081 ctggttcact ctctacagga aacctaatca aaccctaacc gatcacacca ctaagaagaa 
37141 gaagaagaag aagaagaagg aggagaagtc aagtgttaat ctgttggctc caatctcaat 
37201 tctcaattct catcctgtgg aatggtcagc tatcatatct gttgatcatt atctctacgc 
37261 cattagtgca gacatcgaaa aggctccgba ctctaacgtt ccttacctag atfcgtcgaac 
37321 tcacacfctgg aatgaagc!:c caagaatgag attggctcac actaatagtg aaUttgaagg 
37381 aatagtgtat ttaccaggaa gcttfcgaaag tccagattct ttgaactgtg ttgaagtgta 
37441 caacacaatg actcagactt ggaaaccggt gccccccgaa aaacggatgt tcaaattgga 
37501 aaatttggaa aaaaagattt actacaagag ctttcattta gactctaggg ctgggaaagg 
37 561 tctatctttg agctataaat cgaagcattt gacatgtgga ttggtagtat tggatacggt 
37 621 ggattcatat cttcgctctt cttgtatgat agagaatata gcctattttt atcgtaaagg 
37681 aaattttatt tggagaggac ttgacggtaa gatactggtt tacgggaaaa tagagggttt 
37741 agaagggcta Gctaagtttt ctagatattc tagtgttcaa ttggcggaat atggcgggaa 
37801 gctggtggtt ttgtgggaca agtatgtacc tgcgagtggg tataaggaga agatgatttg 
37861 gtgtgcagag atttcgctcg aaaaacgtaa cggtaaagag atttggggga acgttgagtg 
37921 gtttgatgcc gtgcttacag tccctaagtc ttacaaaatc ttgtgtgcta ctgctgctac 
37981 tctttgattc gatgactcgg gaacttgaat ttgtcaagct atttattgtt tattttcttc 
38041 tcagtcgaac atagtcatta caattcagtt aactcatfctt ccattgatat tactccttaa 
381Q1 tttttttcat agttcaaatc aagttagatt tcctacaaat gcacacaaaa cgaaattgtt 
38161 gctttacttg cttaagtgat tcaatgaata ataggttaca gaatatgatt caaaatccac 
3B221 tattgtttta aggcccatta agaatcaatg aaaatgattt tttfctatcat ttctataagg 
3 8281 cccatgcgtt tgtgttcact agcagtgaag tgagagtgtc aaatccctaa attcctccct 
38341 ccctgcgttt gtgttatctg caagtcggag aagaagcaga agaaatcgac taatctctcg 
38401 ccaatgtcgt ctccagagaa gaagaggaag aagaatcaga aatcgtcccc aaatccgtca 
38461 cttccagaag atttgatagt gagcatctta gcccgcgtct cgagatcgta ttacacaaat 
38521 ctctccgtag tctcaaaaac cttccgatca atccttactt caccggagct ttacaaaacc 
38581 cgaacGctct taggcaaacc cgagactttt ctctatgtct gccfcacgctt ccctgatgaa 
3B641 gctaatcctc gttggttgat tctctaccgg aagcctaatc aaaccctaac taagaagaag 
387 01 aagaagaaag aggattcaag tgttcatctt ttggctccaa ttccaattct caattctcca 
38761 actgtggaat ggtcaagtct agtcgctgtt ggttcttatc tgtacgccat taccgcagac 
38821 atcaaagatt ctccatgctc taacgtttgg taccttgact gtcgaacgca cacttggctt 
38881 gattctccaa gattgcgact ggctcacatc aatagtgact ttaatgggag gacgtatttc 
38941 ccaggaagct ctgaaaagcc agattctttg aactgtgttg aagtgtacaa cacaaatact 
39001 cagacttgga atcccgtgcc tccccaaaaa cgaaaactca aattcggaaa catggaagga 
39061 aagatttaca tacccccttg tcaagagaat cataggaagg ctgtggctct aaacccaaag 
39121 gttttgacat gggaagcggt aggactgggt acgaatttgg ataggggttc tttttgtatg 
39181 atagggaaca tagcctattg ttatgaccct agtgggaagt- ttagatggat gaattgtaac 
39241 actgcggaag gagactggag cagattagag ggtttagaag ggctacctaa gtttgctaga 
39301 tatagtactg ttaaattagc ggaatatggc gggaagctgg ttgfctttgtg ggacaagtat 
393 61 gtagctgcaa gtgggtataa ggagaagatg atttggtgtg cagagatttc gcttgaaaag 
39421 cgcaacagtg aagagatttg ggggaaggtc gagtggtttg atgcggttct tacagtccct 
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39481 aagtcttata aattcgtctg tgctaaatct gcfcactgttt gattgatgac tcaggacata 
39541 ggtacgaagc ttcttttatt tagtatgtca cagttgaaag aatgtcttgt atctttttcg 
39501 gaatgcagtt gtcattgttt tcgccttaca aacacctgca ctcacttgct tgtagatgat 
39661 taagcttatt ttgtgactgt ctctttgata attcctttga tgccaaactc tttctcctac 
39721 tgttttttgc tctgaafcgtt gtgttgcaaa atgcctatat ttatgtgact atgatgaatg 
39781 cttctctcat gcttttagcg gatattgttt cacttctaaa catatgcact cacttgtaga 
39841 ctgtttaagc ttactttgtc tatgagaata cttttgtttg tcttgcatag aaaagtactg 
39901 ttatctttac atttcacgat aagagctaga acaagataca cagttagtgc ttgttttgtt 
39961 gtttaaactg aggcttatgt tcatgtcttg tacbfcgtagc caacctaacc tacaaagaaa 
40021 ttgctactgt aacaafcttga ggatcttgtg tgaaaatacg ggtggacttg tcaagccact 
40081 ctttcttttc ttctcaatcg aacactatta ttatccaatt atatactctc tattttttat 
40141 gattcaattc aagttagatt ccattatgca cagaaaatta aattgatcat atgtccaaag 
40201 ctctactgtc atttttgaat tttgttgaaa ttttgaaggg ttaacttatt ttgtgccatt 
40261 gaccactatg atgcatttga aagacttttt ttttgttgtt gaacatacgt gattactagc 
40321 ttcataggaa gaaataagtg taacaaaata tatatatttg acgaattttt ggtcctctac 
40381 tggtaatttt gagttgtgtt gtagaaattt gtggtgactc aaaactattg ttttaaggct 
40441 cattaagatt caatacaaat gaattagata attctataag gcccattaag attcaatgca 
40501 aattatttag ataatttcta aaggcccact aagattcaat gaaaatctgc aagtcggagt 
40561 agaagcagaa gaaatcgact aatctcgcca atgtcgtctc cagagaggaa gaggaagaaa 
40621 cggtcgcttg aaccgtcacc ggaatcgacc ccaaacccgt cacttcccga tgatttgatt 
40681 gtgagcatct tagcccgagt ctctagattg tactacccaa ttctctcgtt agtctccaag 
40741 agctctcgaa ccctcgttac ttcgccggag ctatacaaga c.tcgatcttt cttcaaccga 
40801 acggagagtt gtctctatgt atgcttagat ttccctcctg accctaaccc acgttggttt 
40861 acactctacc gaaaacctaa ccaaaacatc accgagaaga caaagaactc aagcgggttt 
40921 gttttagccc caatcccaaa tcaccattca catagttcga gtatcgttgc tattgggtct 
40981 aacatctatg ccattggtgg atccattgag aatgcaccct cttctaaagt ctcaatttta 
41041 gactgcaggt ctcatacatg gcacgaggca ccaagcatga ggatgaagcg gaattatcca 
41101 gctgctaatg ttgttgatgg gaagatttat gtagcaggag gcttggaaga gtttgattca 
41161 tctaaatgga tggaggtttt cgatataaag actcaaactt gggagtttgt gttaagccct 
41221 ttagcggaga gatttatata fcaggagcctg gtgattgaag gagaaatcta catttttggg 
41281 gataaggttg tgacttacaa gcctaaagaa gatagatggg gaggtgtagg agaacatcag 
41341 agtatggatt tgggattgtt ttttcattct tattgcgtga tagataatgt actgtattgt 
41401 tatcgtcctg gtggaatcaa atggtatgaa tcagagaaaa gatcgtggag gaagttgagg 
41461 ggtttgaagg gattgtctaa gcttgctagt agctgtgtta agttagcgga ttatggtgga 
41521 aagatggctc ttttgtggga caagtatatc ccttgtagtg ggaataagag tcattcgatt 
41581 tcgtgtgcgg tggtttccct tgagagatgc aagaatcagg gtattagggg aaaggttgag 
41641 tggtttgatg atatgcttac ggtacctagc tcttataact ttgtgggtgc tcttgctgct 
41701 actctttgat gatgattgcc atacgaggca cacacatttg attgtctagc tagcttacag 
417 61 ataattgctt tctatgttgt aataacttaa tgatgggatt cttcttacca tattatt.tca 
41821 aaatgcaact ggcttcattt ccttcttcat ttttgttttc ttgtcaagat caaacaaaga 
41881 ttattcagtt atagcgtctt ttgttgtcta gaaatcactg aactagtagt tagttattac 
41941 accttcaccg gttgaaacaa caacttgatg ggacagggaa gtgagaatct tttttgcctt 
42001 ctcggtctta gaaagaatgt acatttgcga tgagagtaag agcaagaatt taaaaaagaa 
42061 caaagaggaa aagccgaaga aattgacatt gtttttgctt ataaacattt gcagtcactt 
42121 gcttgttgat taagttgatt ttgaggttaa ttgacaacta taacaaatca gaagatttaa 
42181 gttgcaatta tgtgcgctaa gctaaggctt ttagactctt atcatcatgt agtaacatca 
42241 agccaagtgc caataccttt atatagatac ttcctgaata caacaaaatt ggacacaatg 
42301 atctgagttt catattatat aatataaatt aactaaaccg atttctcaac cacacatttg 
42361 ctaaaggctc aatataaatt aattaaccct aaatccttct tcttcgacct tttatttggc 
42421 gccaaacccg cagttacatg tttttatttc ccgctcgagc aaattgatct gatccgagtc 
42481 tttcaagcgg ggagagaaag agagagattc gttagtacat tgatctaatg gcgtcgaatt 
42541 cggaagcggg aggatcggcg gattacgaaa cattgatgtc gacatcggac gttgagctat 
42601 tgaagagggc gtggcgtaac ga.gaaggcgg cgccggagat tcttcagtac gaaggagctc 
42661 ttgtcgatag agccaaagaa cagatcgaat tagtggtaaa atgtctaaaa ccctaaatta 
42721 ttctgttgga ttttttccca gaaattaacc actaaatcag aaatcttttg gtctatgatc 
427S1 caaattagac tttccaattg ttagaatagt attcgttttt gtgtgaaatc gaatgccaca 
42841 ttaaacttag ttagagttgt tcattgtctt gtcactttct agtggttgca tttggaaatt 
42901 atcagttttt gaaatgtttg taaataaaca ggaagaaaca attgaagatt atgtggaaaa 
42961 tggcatagat cctctggtgg tgtctctata tcagatggat ttggacagag cccagtttct 
43021 actcagatca tatctcagag ttaggcttct caaggttttt - ttttttttcc aagactttca 
43081 tattagtttt gcgagtttat caacgttgat tccaagtgtc aatgttgctt aattgacttt 
43141 gcttttatgt catctccaat ggtgttcttt tgattgctgc agatagagaa gtttatgttc 
43201 cacaacctca actcagaaga agctgaaagg cgactttctg agcaagagaa agtgtttgct 
43261 accaggtttg ttttcttact tgtctgactt tahgttaatt ggctgtcttc ttccttgtat 
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.is?L gcgttttgtt ctaaatgtta taggtttggt gtagaatctg tgtggaatag atagtgttta 
4"ij81 gagatttgtc tggtactact ctctacatgt ttgagttgtg aatgggafcac aatgaactta 
si 441 catattaata tagttcttgt tcatggagta tttgaactgt cccgagaatt gaaaaacttc 
■ii'jOL aggtagccta ggcacttttt tcatgaatat tttgtttgag tggttttttc agactttaaa 
;'> j61 ctcaattttt ttacaattta cgttagtggc tgctgttgtg agacatatat ttgaatgatc 
4) 021 acaggtgtgc tgatgatttg gcaaagcafct tcgaagagac tgtactgctt aagctgcccg 
-1)681 agaattatca gtcggttctc aagcaatccc ttataagtga agfcggatgat. atgggtacct 
■4 3 741 ccctaaatct tattactttg gaccataatc tcagttggtt aagattgttt gctattatat 
41301 ctgagcattc ccttcacagt agctagagtt gtgtttgtgt tagtgatgac tacacttgtg 
4 58&1 aacttacagt gccgcagccg catttggata cctttgttgt ttgtagaagc aagaactttg 
'ii'ill. tctccctcaa tctatacgaa gagtgagcat tctcttattc ctccatacac atatatatat 
ii'jai acacaatctt tctgtaatcfc aagtgtgggt ttgcctttac ttaccttggc agaggagaga 
4 4041 gccGcgagac tgtggagatg gagcgcggtg atctctactt catacgttac aagattgtaa 
44101 aaagggcaat cgaatctggc caaattgact tgatctgagc tcgccttttt tttcactcaa 
4 4161 atctcttctc atagaataag tttcggatga tgctaccagt attcactctg tatgatcttg 
44 231 tgttacttga gttgtatagt gagtttgagt ttctgttctt gatataaact tagtagttag 
4 4 ? 8 1 tatctttgta gcttcttgca gttgtgccta gcttagctat ccattggctt aaccataacc 
4 4 141 agtggcggag ccagaacaat gactcactgg ggtcaaaaaa aagttaaacg aaacgataaa 
4 4 401 ctttactata aacataacaa tgtcattaca aataaaaact ctaaagatga actctacgtt 
4446 1 cactcatatc ttggaatcgt tttatcacga ttcgtttgta acagtgttaa gtacttgctt 
4 4S2I ctcaataaaa caaacaagac aatcacktaa gaacacatca ctgatacggt tacgcaatgt 
4 4'>ai agtcttcaca aaattcatcg cggaaaaaca tctttcgaca gttgccgttg caacaggaag 
44&41 aatcaaagat agctttagaa gacgatatac caaaggatgt gaaagatgct tttttgtctc 
4 4 7 01 taccatcaca cgagcaagat caccaagaca ctttaaatct gtaaactttt catcgttttt 
4 4 761 cacattatca aggtagatct caagttgatg atcaagagat ctacgctcca caaaactaaa 
44821 gtcatctgga taaaactcga tcaatctcac gagcatggac ttatcaaact aacgaaaaga 
44881 atctatcgga ctcacagaat ctatggtcca taactaaacc acatacttct ttttaaaagg 
44941 gcctgtctgt ttatttttta ttttcttaga tctttttgta attggtccat gatgactgag 
4'!>001 taagacccca aaagccgtaa aaagtaaaaa aagatggaat ctttatagtt taaaaacaaa 
4S061 aggctagtac taaaatatct agttacggtt acgtgtgccg ccatagaaaa acaaatcttc 
4S121 tgctcagtct tggattgaga ggtaagagat ccccaaaact tcgcttttca atctctgaag 
4S181 ctgcttttga attctcatgt tgatttcttt gaatttgcat cgatcaagtt taggtttttt 
4 "3241 ggatttgggt ttctcctaaa tatcttcttt gaccagtgtg tgtgtggttt ggatttatcc 
4S301 ttttggatct ttgtttctcc agaafcaagta cagaaattgt aatctttgtc aaattttgtt 
4*SJ61 tattttgagt ttcttagtga actcagttgg ataacagtga cacatatgag atttgagatg 
«'b421 aatttatttt ctggtaaatg catagagatt attaatgttt cttactgatt tgagaattgc 
4!>481 aaggcattgt cctttgagtt tggtgttaaa gttagcaaat tggtgactat ctctattgag 
4Sb41 tttacatgtt ttcttggttg ttattgtgag gatttaagat ttgcctgctt ttgtatttct 
tSfeOl ctttggtctt gtaatgtcag aattgtttct ttcttaaagt tgatttcttt gtcaatcttt 
4'j(i61 aaacagagat cgaggtgttt ttgacgcaat agaagatcgt tctgcgtgtt attatccaag 
4'>721 aactgcttta gtgttataaa catttgagca gagatgcatt tgtggccatc gttgaaactt 
4'j781 cgaaactct.t tcaagagcac tagccagaag agatatcaaa ggatgaacag tgggaaacaa 
4S841 agtcaaagca atcagcagaa actccttgac ggcgacaaag aatctggaag ccccgagatt 
4'j':*Ql tcgtctggtg gtgggtttgc tgttgtttgt agagatgttg ccatggttct ttcttgctgc 
4*) 961 tactgctgtt tctgctgtgg aggtaaataa accttctcga cacttttccc cttagctagt 
ihOll atctgtgtga tctctggacc aatgttttcg tatgaccUtt gagcattata tgcaagcacc 
46081 atggcacatt ctcatcacat attaacttag ttggtatcag tttagaatct agattaaacc 
4b Ul atgattttta tcttttagca gatatacaat ggttttcctt tagtgattat tcgttcttct 
4 6 201 tcatcaattt ctccgattct gattctgatt cctctgattt gttcagcttg cattgatgag 
4()26i gaggagattt gatgcaatca tgtaagctcc tggtggagga gttaagtgaa agattttagc 
ih'i'll cttagaaaac atcaatctct caaaatgtaa ttagaaatgt tgtttataat taatctgaat 
4b 181 tctttttcca cctttcccaa atcaaattta tatattatga ttgaaattca cgatattgac 
4 6441 tttatcaaat agataagcat ggttaaatga tttcaaatga ttttctattt aacacaaact 
M>bQl aaacttaaac agacacaact tttgttcatg tagaatttta actattcatg gcatttagga 
4b56i tttgaactgt tgaatttccg tttaaaatta aatgtcattt ttaatgcatt attttaaaaa 
4fi621 ttatttacat ttttatgact ttaatttaaa tgtattttaa taagaaaatt tataaaatat 
•!(.681 attaataaat agtaaaaaat aggtttttca tatgtatgtc aaattttaaa attgtttttt 
•:fe741 ttatataaac gatatttatt tataaacgga aggaatataa ttttatgagt tttttttctt 
'ifcHOl tttttttttg tcatccaaaa ttaattaaaa ctcaaacgag ccaatctcga gggatattgt 
4f)861 ttacacatgt gatatgatgc ggttcggtac ggattttccg cgccaaatta ttcgctttta 
4 6'>21 cattagcact tcgcigaaatt aaagataaag aaaaatttgt gagttcttcc ttgtcgcttt 
4 6 981 gcaattcctc caaatacaca gaaaatgccg gccattcggt tggacaattt tatgaattat 
47041 gagttagaaa tttagaatat aattctctat ttaggaattt agaagattgt tagaagagca 
47101 aaagtcaaaa ttactaagtg agagtattta ctcgactcag ccaaccaata ataatccgac 
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47161 acgtcatata aaagttccag aaaggtcaaa aaaaatcaca cacattaaag tgctttgaat 
47221 acaaaagcag actccactat ccaattaaat tacgaaaata ccacttcggt taactccact 

47281 cttttcatct tcttctctgc cgtcgtcgct cggagctcgc gaagctctct catcagaaat 
47341 caaaaaccct aaaaaaagtc ctttctgatt catctataca gacacatact gtattctcaa 
47401 aacaatggag attccttctc tgaataagca gcaagagttc actctcgcct ccgtcactga 
47461 tctcacttct ccttcctcat ctctttcgtc ctctcctgtt gtagctacgt tttcatgcgt 
47521 taatgaagtc aaggaactcc gatttcagga atcgaaatcg tctgatggtt tcagttttga 
47581 tcttagtccg actcaggtgg ctctcgatfcc gatttacaat ttgcacttct tctggttaaa 
47 641 gaatctactt taatctcgtt ttgaaacgtt tctatgcttg gttaagtaat gcactaaaac 
47701 aatctctgtt tagttaggat tattaaatta tacgtgtggt aagtttctgt tctttgagct 
47761 agattctgat tagtgtattt gattttgtgc tgctcagttg ttcaagttgg ggccacttca 
47 821 gttcacttgt gtatctgatg gttcgatttc ttctgcaaag gaggtacgtg aaacatttct 
47 881 tttgttcagt ttcttctgag tgttcatggg attggtttag tgtagttggt tgcaggagat 
47941 gacttgatcg actgcgcatc cttgtttact atatctagtg tgtaatcgga aatgtttttg 
48001 ctgattttgt fcaccagaaat cgtcattttc tcgaggagtt gtaataaaat ttagagatga 
48061 gaaggatagc aaggagtttt gtgattcatt tgaggaatgt aaaaaggatg atgctgttaa 
48121 acaaggtgcg aatcaatgtg ctaattcccc atcctagttg tatcgttcat caaaatccca 
48181 tttcattgat cataatatat tcatgactct tgatcgtact cattctttga tgtctttttc 
48241 gttcttttgt ttctaggatc agccttgccg aacggaacag ttgtttcagc taataaaagc 
48301 aagttcgatg ataagattga ggctgcttca gccaaaatgt atttccatta ttatggacaa 
48361 cfctttacatc agcaaaatat gctacaggafc tatgtgagga cgggtaagat ctggcctttg 
48421 tgcattgatt acctcagagg aaggataact ttttttttgt tgttgcgaac agcaatUcga 
48481 cccaagatag ctctttttcg tcaCattact ctcttcaggt acatatcatg ctgcagtcat 
48541 ggagaaccgt tcagattttt ctggccgtgt tgttgtcgat gtaggcgctg gtagtggaat 
48601 tttgtcaatg tttgctgcac tggtatgcat aaatttaaga tattgtgggc ctagaggtct 
48661 actactacta ttcgtctgca ttttgctttc actttttctg ataaacgtcc attattttct 
48721 tcctaattta ggctggtgcc aagcatgtgt atgctgtgga ggcgtcagaa atggctgaat 
48781 atgcccgtaa actgattgct ggaaacccat tgcttgctga gaggatcaca gtaagacctg 
48841 tagtccagtt tcgttt.gtaa tttttttcat ttatatccat ttttctgatg ttcttcaact 
48901 Ugacacaatt gtatttgtag gtcatcaagg gaaaaattga ggatattgag ttgcctgaga 
48961 aggccgatgt tttgatctct gaaccaatgg gtacgtgttt cagagacgga ttcttctttt 
49021 cacgtgctct tcatgaaaat cagaatcttt cagaaggcaa attttggtct gtgagaaaat 
49081 aaatggagaa acaccattgc tgattcaaag ttatgtatga aagaatatct cttatatctg 
49141 atatacctgt gtctgtaggc accctattgg tcaatgagag gatgctggaa acatatgtta 
49201 ttgctaggga tcgttttctg tctccaaacg gaaaaatgtt tccgacggtt ggaaggtaaa 
492S1 gctgatcttc gat.agtttta cttttctatc tcataaatgt cttctttctc atgtaacttt 
49321 gtcactgtag gatccacatg gcacctttcg ccgatgaatt cttatttgtt gaaatggcaa 
49381 ataaggtaca tattgcacat cttatcagga agatatgttt gagattgctt gtaacataaa 
49441 ataaaatggt tgaattaagt atgggafcggt taggctttca gfcaacttagg cttttctacc 
49501 tcbctatgca ggctttgttt tggcaacaac agaactatta tggagttgat ttgacacctc 
49561 tatatgtatc agcccaccag ggttattttt cccaggtata tgcatttaca accagtgaat 
49 621 cactgatatt tctttgcttg tctcatcctt tacaattttt catatttact gattttgtag 
49681 cccgtggttg atgcatttga tccgagatta ttggtggctc cctcgatgtt tcatgtgata 
49741 gatttcacta tgatgacggt aaagctattt cctacattag attcggttcc acggtgtttc 
49801 cttaaaattg aggacacttt tctgtcatgc tatttctatg tgcagtcgtc cctgtttagt 
49861 gaatggcatg atccaaattc aataactgtc aaattttttt cacaggaaga acaattttat 
49921 gagattgata tcccgttgaa gttcactgcg tctgtgtgta ccagaattca tggacttgcg 
49981 tgctggtttg atgttctctt tgacgggagg tatcaagccg taacaattta tatgttttca 
50041 tcctgtgcta gctaaagatt aatattcctt tttgaccatc aaagcctcaa aggtatatca 
50101 tacaaacctg atgctttgta tccaacagca cggtgcaaag atggtttaca actgctcccg 
50161 gtgcaccaac aacccattgg taccaaataa gatgtgttct ctcgcagcct attcacgtca 
5 0221 tggcagggca agagatcact ggtagacttc atttgattgc ccacagtgct cagagttaca 
50281 ctataaatct aacaccctca ggtcaatttt gctttctgac atctctcctt ttgaatcata 
50341 tcatctactt tggacttatt ttctgattgg ctttttaaac acagctaaaa tgtggggtcc 
5 0401 cggtgccaat caaggtggaa tcctccagac atcatcgtgc aaactcgatc tgaaggaacc 
50461 ctattataga atgtctcagc cacaggttta ccctacacag gaaccaccag ctcagtcgca 
50521 agtaagcgtt tgttattgtc tt.gattaaac aaCcaggata tgatc.gctgt tgtgtttaat 
50581 catttggcgc tttaaagcca ttacattctt tgaatttgca tatttgcaaa agtatatatc 
5 0 641 agcattaaac atatgtggac attctcttgg tgcaggacat acatatacat agtgatgact 
50701 tggaagaact agagttgcta caacaaaacg caaacgctca gctctagaac tagaactaga 
50761 aatagaaccg ttgtttctgt ccatgtattt gtactttgcg tctgcttcct acggtttagc 
50821 accaaccctt tattttcaga cattatcttg cgttttcttg tctagttcga ctctatttca 
50881 ctacgtatca cccacatggt atttgatcct ctacacgcaa atgaatcatt ctattttggg 
50941 cgctaagcca ggttaattga acttctgtac aactatgttc tgacactgca atttttaaaa 
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51G01 ttgatttgga gttgttttca aatctgcaat agggacctag ataattcata actggagttg 
51061 acttcaattt attgagtttia ctgaaacaaa aatgaatcgg agaacgtgca aattgtgcaa 
51121 tggacaaagg aactgtaata gtatagagca aattttaggc taagctgttc acagctgcaa 
51181 ctgcaggctc aggtggatta gcaacaatga cactgaagca gcgtccacag agagtaggat 
51241 ggtcagaaaa tgatccgacc tggcccgagt aattccagca cctctcacac ttagacccct 
51301 cggctcgtga caccccaatc cacactttgt tctctccctc cacatactct cctgtgtgtt 
51361 gcacactgct tatcatctcc ttctccattg aagacaatac ttcaacctgt aatatgcatc 
51421 cacatcagtt tcatggctat agtctcatag aattacactc agcatgatca aatctcaact 
51481 tcaatcgaaa attgtggata ataattgagg aatttgagat gaatcaccat caccaagtat 
51541 aactaaaaac acaatatatt gtaattcctt catacctatt tcttagggat ttaaagggct 
51601 tttcgagagt gacttctata gatctgatca gtacatgtgt ctactgctac aaagtactct 
51661 tcttttaaca tctaaacatt tagcatttct cttggggtcc aaatactata tctaggataa 
51721 agaaccagac tggattaaaa caaattccgg tttgtgccta atcctattca cactaaatct 
51781 tttacttatc tgttttggga agagcctact tgtagtttgg cctgaacttt acacagaaaa 
51841 accacggttc acataaaaac ggccctactt gttgatggtc accatgattc taatttatct 
51901 taagctataa caggaatatg aattttccag ataaggagat ttagtgctta cttgtgatgt 
51961 tatgaatatg cgttgcagag tgtcagcctc gttctttgct tcactcatct ctagtaattt 
52021 tgcagccatg cctgcatctg cagtatgcaa atacaccttt gcttctaaac tggaaccaat 
52081 cattttctca ttacgtgcta gctccagcac tttgttcacc tcagtcctca actgtcacac 
52141 aaacaccata agtatttcta agatcctcat tacaaaaagc gcatcaagaa taaggccagt 
52201 aaaacaaaga ataaacgtaa aagttcttcc agagaaattt cgataatgcc atatcataag 
52261 aactgattac acgatcaact tgacataaca tcgagttgaa tattctgata ttgtgtaaga 
52321 tttcattgag ttcatctcca ctaagtagct aaggaactct attactttaa ttacaagcag 
52381 agtagagagt atacaatggg ctgaattgct gagtttttcc gaattgattt aggagagttt 
52441 tgatacctcg agaagccttt gccagaagag aacatctfcca gcaggaaatg aaagccattg 
52501 tucattcaaa gtaggccatt taagttcgaa gacaaattct gctgcagagc catcttcatt 
52561 tctgtactca aatggaagat tctgccagac atcttccgct aggtgaggta cgattggagc 
52621 aatcacfcctc aatatggata aaagatgtgt tgaaagaact gtttgacagc ttcttctggt 
526B1 aaaacttgaa gtacccctga gaaaaaagat ttttggtcca gactacgttg aatagcacag 
52741 acgtattaag aacaggtcta cataggttct agcaaatatc aaccaaaaga agatgagaaa 
52801 cttacccagt atacagtcta tctfctagcta tatcgaagta gaaatttgac agatcaacaa 
52851 ttgtgaatcg ttgtatgatc tgaaataaag aaagctcatt tagtaacaca gatgatctag 
52921 aacatgaaat taagcctcat gaggttcaaa ggtccattgc aaatacatca acctcactct 
52981 caccttaatg gatagtgatc caattgctga actaaagttt cttagatata ttaaagcttt 
53041 taactgataa attaactcct gtaagagtga tctatgtctg tttcccgtca caggaaaatc 
53101 aaCacatacg tgaatgatgt accaaaataa gatggagagg ttacctggaa tattttaaag 
53161 aactggtagt tctcgtaaca ttcttgtata ttctttacaa cattctcaag ctgaaagaga 
53221 gcatgctgat caatgatagg taaatcctgg tatggaacag cattatcaac ctacaaacca 
53281 acacaagtta aatattaaca gattgtcacc aaccccagaa tataaaacag aagcaagaag 
53341 tcttgacaag tgggaaaaga ggtgaaaaac actactctat gtaaagctta agagttagaa 
53401 aacctactct ccaatcatga agatttccca agaggtatct taacgttccc cttaacttcc 
53461 tatatatatc agacatttga cgaagaatct ggggacctat tagtacatcc cctgtgtaat 
53521 ctacactaga aacccaaaga cgcataacat cagctccgta agcaggtgcg tcctgttgta 
53581 agaagacaaa cccatgccag gaaacagatt aaataacgaa gatagttcta tacaatggga 
53 641 tttacaatgt gacaaacctt tgaafcttttc cctccttcaa tgaccaaacg tgggtccact 
53701 acgttaccta gagacttgct cattttcata cccttctcat ccaatacaaa tccatgtgtt 
53761 ataacagcag aatagggggc ttttcctata tacaatcatg caaaatgtga agaaaaaaat 
53 821 caaaagccac acaaagatag ctaaaaatct gtagaataat ctcaatttat aaggcaatat 
53 8B1 catggcatgc acttctcacc ttgcgttgcg atgcttgtta acaaagaact ttggaaccat 
53941 ccacgatgct gatctgtacc ttctagatac acgtctgcag ggaaactaag accttcgcgt 
54001 ttccctaata caccagccca agaagatcct ggacatacag atgggaaaga aagtgatgta 
54061 aaataggtgt tgggtgctta gtgtgactat ctgattttaa ttagcttctt ttatcatagt 
54121 acctgagtca aaccagacgt ccatcgtatc agtccccttt tcataatcag ctgctttatc 
54181 acgatatttc tcaggaagta ggtcttccac tgacatatac caccacgcat cactaccctt 
54241 ttgggaaatt atagctgcca gacaaggtat tcaaataagg ttacagaaaa cacagccagt 
54301 aaataacaac caataaaaca caccatactc tcttataggc ccttacattt aacatggttg 
54351 attgtctctt cattcatgag tggttctttt gtcttaacat gataaaaagc aggaataggg 
54421 acaccccatg tccttfcgtct tgagatgcac caatcagatc gactggaagt catggcagat 
54481 attctgttta ctgcctgaat gacaattgcc aaaagataac aagtaccaaa attaaactcc 
54541 aaacaatata tatgcatctc ttaaaaaaga taaagtaagg aaagacgcta agfccaagcag 
54601 tacggtatca tcbcaacatt gaaaaaatct cacagcaagg tggatcagat catgtggttc 
54661 atgtgctatg aatctccttt cacagctaac acccacacat gataccttta gcccacataa 
54721 tgaagaaaaa aaaatcaact tataaactca gtatgtattc aactaggtac ctgatgtggt 
54781 acccatt^ta cattgttgat tgcatccatc gtggctgtgc gaaacccttc aaccgatgca 
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841 aaccactgct cagtcgctct aacitatcgtc ggtttcttag ttcgccaatc atatgggtat 
■)4 901 ttatgagctg catgacaaga aggaaattaa atcatgagac tctatagcaa atgtttcctc 
s;S61 aaatggttcc tcagaaagca tttgaatcaa actcaccata agattcttcc atgaccagag 
t0021 acatattttc atccaggtaa ctgacaacag cggtattccc ttctccaagg acagagagcc 
bSOai ctctaaactg tccagcttct tcagtgaact ttccttcatc atccaccgga gagacaagag 
S'jKI gcagcccgta cttcaggcca gttgcataat cctcctgacc atgaccaggg gcagtgtgga 
'jS201 ccaaccctgt tccagattct gtagttatgt aatcaccccc tataacgact ggacagtctc 
'3'j261 tgttgtcaat gggatgagtg tacctacaac catgatgaca caccataaac taaggggttc 
b'>321 aatttatagc attacatgca aaactggaat ttcttttatt ctctaactta aaatgtaacc 
b=>381 tgcagttttc aagatctgaa ccaaggaatg ttttaettat gctaagcttc acaccccatt 
'>^44i tcgcttctaa tgctggcaca aggtcggtcg ctacaatcac gaaaagcttt tgttgattct 
'jt>b01 ttagaacctt cccgggaatt ttctttttat tgcttgtcac tgttgattca tcttctgaga 
Sb'j6i atgactgcac ttccacaaca gagtactgaa gctttgcatt caccgcgaca gctgaaagtt 
">'>621 agacaagaaa gaatagaaga agagtcacta aatgtgctta tatagcattc aaaaaaggta 
'j")681 cgtattattg cttctaagtg gagaagagat atgagaaaga acggatacaa agtaaagtag 
SS'/ll gatgcaattt gacagctgag agaaacacat atactaatat aaccccttac cagcattggc 
S'laoi tggcatagtc caaggtgtag ttgtccafcac tgccaagtat atattaggga taaactcatc 
S'jS&I taaaaggctt gttttcgctc caccaaccaa tttaaaaata gcatatatgc ttttggaaat 
5'j'J21 atgaccctcc ggatactaca cataatggca gatacatgtg ggactgatat cagataacaa 
b'i981 ggtaaagtgt ggaattgtca aaatattatt tcatgaaagc ggcatgcgtt gagaacagca 
^»(i04l atagaaaact acattaaakc agggttcatt ggtgatacat aaagaattag tcttcgacta 
!»HOl atggacaaca gatcgcatca tggtaaaatg tcaacggaag taageiaagag aataggaaga 
!>*16l ctggtatgac tttagcagac ttaagttttg cattgaactg gccgtaatat tattattatt 
•jfe221 cttagattat tgattcaaat ttacctctag ttcagcttct gcaagagcag tccgagatga 
bb2ai agggetccag tgaactggtt tcctacctct ataaatgtac cctttcaagg ccatctggcc 
•j6 3«1 aaatacttca atctacaata tatgataaaa acatatgttg actttatcag gctttgaaga 

56 401 actctaataa tcatctccta cacggttatt tcatccagta aataatgtgg gacgtgagac 
'i«>46l cattacctga gctgcttcat attccggatc aagagttaga taaggattgt tccagtctgc 
5 6 521 ccatactcca aatcgctgta aacaccaaga ataatgtcat atgtgctaca tgtcaatcga 
bfeSSl ataccaaacg aaatatcttt cttaattatt atgtcctgag agatataaga atcattatga 

cggttgcatt caaaacaaga aatcagatat gaaaacaaag cttgatagcc accttaaatg 
b6701 attccatttg tgttttgact gttgcttttg caaatttcgc cgcctttgcc cttaatttta 
S6761 atggtgtaag ttcctttctt acfctcctgat ccagggactg cagaactgaa acgtttgacg 
Sfe821 gttacaccaa attacacagg caagtaagta actggaaaat aacaaaagca cagttacttt 
!s6S81 ctgtttctaa tgactcaaag gtccataacc tattgtgtgt ttacctttca actcaattgg 
J 6 941 aaggccatgg caatcccacc caggaacgta ctgaacttta tagttttgga gcagctgcat 
'jTOOl gaagaagcag gcagtttgat gagaaggaac tccactattg gacgtaagga aaataaagaa 
57061 tccctaagga gagtaaatgg atgtccaagt aggaaagaca aataccttat agcgattaat 
57121 gatatccttc agtatcttgt taagagcatg acccatatgc agatcaccat tggcataagg 
57181 aggaccatca tgaagaatga aactcccctg aaattctaac taactatcag acaccagtct 

57 241 ctcaaagcaa agttggcata agataatttt gagaaactca ctccattgtt attatcagaa 
57 301 actctcttga agacctggtt; ctcttcccac aacttctgga gttcaggttc ccttgtcaaa 
'j7 361 gaattcgctc tcataccaaa accagtcttt ggcaaatcaa ccgtttgctt gtactttcca 
tsV421 tcttcttgct tttcccctaa ccaacacaca acggtaaatg agtgccacat tgcaatatag 
5 7431 tattaagaaa gaagtacaag caatgtgatg caaaccttca gatgctfctct tggctgccat 
5 7 541 aacaggccca cgagaccttc gttttgatga atgaccaaac tcattgtttg gttcagttga 
57 601 aaaccgtgag acatataaga atgaaaaagc cttgacggaa gaccttccct tggcaagaaa 
57661 agaatccaac ggcgtgtttc tccttaaatt tgaacaactt ttacctgaca aaacctaaac 
57 72 1 ggggacaaac tacatctcaa aaactatttc cacgcaaatt ttcagcaaga cacaataact 
577 81 tctatcagat tgatataggt ggtagaagta gtttctccaa acaagttcct gagacaacac 
5'J841 atttttctaa gaaagaaaat agagcataga aacagactta gtttggttaa taagggcttt 

57 901 aagcattaca gaggacacat atgagctcgt catgtgatgc tcagatagtc agaactcaag 
57961 taaactactt gcaatttcat tcttcaaaga tataagcaat ggataaaagt caaccaagaa 
58021 gcctctgaaa ttttatcgaa caactggtgt gcaatagtaa accttaagaa cacaaacaat 
58081 aacagaaagc gaactacttt aacgaaaccc acaaaccgaa attgccaatg cattgataat 
5iil4L aagaaccaat tcgtcgacaa atgtgttcta ggaatcgaac ttgattagaa agactatcac 
S8201 tttccccagc acacaaaaca aagagtcaac agagcagaga tggattcaaa gtaatcaaat 
582 6 1 ttacattcac atactctgta agatgaagat tgaaccatcg ccattgcagc agcttccctg 

58 321 ggatttcccg cgaaggactt gaagaaagac gacatctgct tcaagtttct ctctgtcgct 
58^81 tctatctatc agagctcatt ttcacggcga aacaaagagt gagaaggaag gaagaagact 
58441 aacgtcacca aattcaggga agcgtatg r.g tgtatgtgtg tggataactg aaaagttgaa 
SaSOi actaaaacca aacctcagaa ctcggtttca atttttggat tttatccgtt aaatccctaa 
58b61 atctcttgtt ttttttcttc aagctcataa tagtaaaaga taatcacgtt taaatacata 
58 621 aatatactaa caagtaacaa gaggcgaaat tctttcaaca tgtatggtaa atagattaaa 
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58681 tacaaacaaa aacaaaaaca accattagat aaggcgggtg aatttatagg catatatcat 
58741 cagtaatgca agatataaca gttagactca acagtagcat caccagtctt catatcgaac 
58801 aacttggtcc gtgcttcgac ataacctcta gcaaaccgga acattccact gcctccaacc 
58861 accggcafctt ccctaacctt ggacatcacc gtgfctccgac caagaatcgt tatggtactc 
58921 ccgttgtact tcccagtctt aaaagccaag ttcatcgcca tcaacaaccc tatctccttt 
58981 tgggctgctc caacgtacat cccttgggct tgacccacca cagtcgagtt aatgggcaca 
59041 tctaatgtta aagcgttgtc tatcatggtg attgafcccga aatatgagga acttgagtat 
59101 tttgcaaccg gcccttggat cctgatggaa cttgggtttc gaccagttac aatgtcgtgc 
59161 caatagacac ggagatgtgt gagtttctct ttfcttgccga ggccgaggtg ttttcggttt 
59221 attgttcttg cgaagtcttc accgtttttt ccggctaagg cggttgcggt gaggaggagg 
59281 atttgtgcgg tgaggatgag aatgagcttt gacatgttag ttttctggtg tcgtacttta 
59341 attgtttcat ttgttgttga tagaatgata gtttatgtaa attgtgggtg tatatactgc 
59401 aactgtgcaa cgtgctatat atatcgagta atagatgaga tcacgagtgt gaaaattaaa 
59461 aaggggaaat agataaacga gtgtctacat atagaagaag acgtgctatt tgataaatct 
59521 tctagaaaat aaagtgagtt tggattattt gattttgfcta ggtgaataaa gtgtaaaata 
59581 ttgtatttta tgggggaagt tacaaaagat tcaatcaata tcagtttcta aaagtttgat 
59641 tggcaaagta tttagatgag tatagctagg tgagcatttg taagttatac gattcctcta 
59701 agctttgatfc tcgagtcttt tgttggggtg ttaaattctt aaaaacaaaa acaatattat 
59761 atccttfcttfc taattaaatc tcacgcttta cctttttatt tcatgcaaaa gattgaataa 
59821 ttgaactctc tccactttgg tttgtttatt catataaaac caaacagaaa ataaaaacac 
59881 cttggtcata accttaatag accgatataa tcgggaaagg aaaacaaacc taactattta 
59941 aatgtttctC taacaaaaag tattgaaact gacctgtfctt: agttggaaca atatgcctcg 
60001 cgtgaatatc ggtccagaca acaacgctca aatagagatc actactctac aatcacaaag 
60061 caaatcaaga aagctaaaga aaatcaaaat ggctagtgga agtgagcgtg tggtggttgt 
60121 cggagtagat gattcagccc atagccacca tgcactcgag acggctcttg atctcttttt 
60181 catacctttc aaggccaacc cacgattcaa acttgtcgtc cttcacgctc gaccaacagc 
60241 gaccttcttc, ctcggtgttg ctggcccggg aacggtcgat attataccaa tggtggaaga 
60301 agatttaaac aagaccgcgg acttggttaa gaagaagtgc gccgaagtgt gctcggctaa 
60361 gtccgttgag atctcgtctt tggaggtgat tgaaggagat ccgagaaaca taatgttgga 
60421 agccgtggag agacatcatg catgagttct. tgtccttggt agccatggtt atggagctga 
60481 cgtcaagagg atgtttttgt ggagcgtcaa gtgtgcgtaa ctatttagct cattaagcac 
60541 attcctccgt tctgatcatc aagaagccta aggataaacc tacttctacc aataaacagc 
60601 aatgaggaac aagacgacgt gattttcgta tatgttacca ttctgttttc ttaagtaaga 
60661 ccctatctat cgaacattgt gcatactctt cttccccaac atcttccatt tcaacgttaa 
60721 tagcttagct ctaatgtata tactcattgt atatattcac tatattaatc gcctcttctt 
60781 tgtgttatta atctccttac ttttttctgt atatattgca taaaaatgaa gttcttagaa 
60841 catttttaca tttattctat atgatcataa ttgaaaggtt ccaattaaat ttgttctttg 
60901 gaaacttgta gtaccttgca aagagcttga ccatgtacaa agctgaggta gaaattaaac 
60961 cttatcgtga atgtactata accaatgttg ccgagaactt gtaagcaatg gcttcagtaa 
61021 cttctttttc ttttctcagt ttacctttac cactttgtga gtcttgtttg cataataagg 
61081 tttgatccga gcttgtagaa gacgccactt gtgttttatg cgaaagacat ttaaaactcc 
51141 tataaccaaa taatcatgtt gctgactgct gaggacttgt aaacaatcga ttttagcaac 
S1201 ttttttcttc ttttctcagt ttacctctac cattttatga gtcaagtttg cataacaagg 
61261 tttcaatccg agcttgtgta agataacact tggtgtctta aatataaaaa gatccggcaa 
61321 tgttgatttc cttctaagca atattttcaa tcttcaaact catttataca tagttaacat 
61381 aacctccaaa acataaatat cacagatcca aaagatcgga taaacataat ctctccaaat 
61441 aatgatgcgt tactctcata ctctccagca gttctcaatt ctatctttca tcgtcacaat 
61501 atggaccgtc caatcggtac caccacagcc accggttcga tgcgatcaaa ccggctgcac 
61561 cgtctccaat gcaaaagagt gtcttccaca aattcactga gcccttcgag aaagagagac 
61621 acaatgccac aactaaaaat tgaactgaaa attgtagggc tttaccccaa agggagccct 
61681 tccagaaatt ctcgaaggcc caagacaagc tt 
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RCH1 .2 AB028621+AP002037 (BAB03091) 
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sw!vimavytlvkaqritgkqee!dswe\rtiyqkidfsiddivknitsanvigtgssgvvyrvtipsgetiavkkmwskeenra 
fnseintlgsirhmiirligwcsnrnlkilfydylpngslssiihgagkgsggadwearydwigvahalaylhhdclppiihgdv 
kamnvllgsrfesyladfglakfvsgegvtdgdssklsnrpplagsygymapehasmqhiteksdvysygvvllevitgkh 
pidpdipggahlvqwvrdhlagkkdpreiidprlrgradpimherniqtlavsflcvsnkasdrpmmkdivamlkeirqfdm 
drsesdmikggkcekwqpqplppekivstprgssncsfaysdesv 



RCH1 .4 AL022223 (T05050 or CAA18216) 

mppniyrlsffss!icfffipcfs!dqqgqal!sv^/ksqinisgdafsswhvadtspcnwvgvkcnrrgevseiqlkgmd!qgsl 
pvtslrslksltsttisslnltgvipkelgdftelelldlsdnslsgdipveifrlkklktlsintnnieghipmeignlsglveimlfdnkis 
geiprsigelknlqvlraggnknlrgelpweigncenlvmlglaetslsgklpasignlkrvqtiaiytsllsgpipdeigyctelqn 
!ylyqns!sgsipttigglkklqsl!lwqnnlvgkiptelgncpelwlidfsenlitgtiprsfgklen!qeiq!svnqisgtipeeltnct 

kithieidnniitgeipslmsnjrsitrnffawqnkltgnipqslsqcreiqaidlsynsisgsipkeifglefldlhtnslsgsligttlpk 
sikfidfsdnalsstlppgignteltkinlaknrisgeipreistcrslqnnigendfsgeipdelgqipslaislnlscnrfvgeipsrfs 
d!knigv!dvshnqltgnlnv!tdfqnivs!nisyndfsgdlpntpffrripisdlasnrglyisnaistrpdpttrnsswritiiiKnM 
avlvimavytlvraraagkqligeeidswevtlyqkidfsiddivknitsanvigtgssgvvyritipsgeslavkkmwskeesg 
afnseiktlgsirhrniyrllgwcsnrnlklifydylpngslssrihgagkggcvdwearydvvigvahalaylhhdclptlihgdv 
karnnvligphfepyladfglartisgypntgidlakptnrppmagslwihgssfdfdifcilgftehasmqriteksdvysygw 
llevitgkhpldpdipggahlvkwrdhlaekkdpsrlldpridgrtdsimhemiqtiavaflcvsnkanerplmkdwarnlte 
irhidvgrsetekikaggcgskepqqfmsnekilnshgssncsfafsddsv 
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RCH1 .5 AC015446 (AAG12526) 

merersnffflflfcswvsmaqptislssdgqalislkrpspslTsswdpqdqtpcswygitcsadnras 

ssisslqflnisstnisgpippsfgklthirildlssnslsgpipseigrlstlqfiiinankjsgsipsqisnlfalqviclqdRiingsips 

sfgslvsiqqfrlggntnlggpipaqigflknittlgfaasglsgsipstfgnlvniqtlaiydteisgtippqlglcseirniyihmnkit 

gsipkelgklqkitsinwgnslsgvippeisncsslv'/fdvsanditgdipgdlgkiw/leqiqlsdnmftgqi^ 

IqldknklsgsipsqjgnlkslqsfflwensisgtipssfgnctdlvaidisrnkjtgripeeifsikriskiHlgnslsgglpksvakcq 

s!vr!rvgenqlsgqipkelgelqnivf!dlymnhfsggipyeisnitvlelidvhnnyitgdipaqignlvn!eqld!srnsftgnip! 

sfgnisyjnklllnnnlitgqipksikniqkitildlsynslsgeipqelgqvtsltinldlsyntftgnipetfsdltqiqsidlssnslhgdi 

kvlgsltslaslniscnnfsgpipstpffktisttsyiqntnichsldgitcsshtgqnngvkspkivaitavilasitiailaawilik 

hlyktsqnsssspstaedfsypvvtfipfqkigitvnnivtsitdenvigkgcsgivykaeipngdivavkkiwktkdnneeges 

tidsfaaeiqiignirhrnivWigycsnksvkniynyfpngnlqqnqgnmldwetrykiaigaaqgiaylhhdcvpaiihrdvk 

cnnilldskyeaiiadfgiakimmnspnyhnamsrvaeygytmniteksdvysygwlleilsgrsavepqigdglhivewv 

kkkmgtfepalsvldvklqglpdqivqemlqtlgiamfcvnpspverptmkevvtllmevkcspeewgktsqplikpsss 
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SEQUENCE LISTING 

<110> UWIVERSITEIT UTRECHT 

<12 0> A novel root specific promoter driving the expression of a novel LRR recepto 

r-like 

kinase 

<13 0> CROP-Oll-PCT 

<150> EP 00870288.8 
<151> 2000-12-04 

<150> US 60/256,204 
<151> 2000-12-15 

<160> 19 

<17 0> Patentin version 3.1 

<210> 1 

<211> 7438 

<212> DMA 

<213> Arabidopsis thaliana 

<400> 1 



ctcgagtttc 


agatgtttct 


attaaataag 


taatgaaagc 


agctctaatt 


ctcccaaaat 


60 


tcacaagtat 


agaaagatag aaagatagaa acagagttat 


tgcatggaag 


ctgaaaatca 


120 


tacagcatgg 


tggatataac 


caatgagaca 


caaaacaatc 


acggcatgtg 


ttatagaaca 


180 


acacttatca 


tctgtattaa cataaagcaa aagttcctat 


acgacatagt 


ccaaagttac 


240 


agaccataag 


cacaccttat 


gatttgttgt 


accctctaaa 


acacaggatg 


atgacaagaa 


300 


gcaaatttgc 


ccaaatttac 


tgagggaaaa 


acgagggctt 


aacgctctca 


tatctcaaac 


360 


aaaaacttct 


caaacagttt 


catgtgttca 


gattggaggg 


caatggcaac 


ggataagctg 


42Q 


ccatcattag 


taggacttgg 


tagcacaaaa 


gacaaaccct 


cgta tiggaat 


tccaccaggt 


480 


cccataaaga 


taggacgacc 


ccaaccaaag 


tctgcatcat 


aaataggtaa 


tctaacccag 


540 


cttgtgattc 


ccaaatttgg gcacttgtag gtatgtgcac 


cgcggacaag 


ggctgacaga 


600 


tcaggctgca 


tctccaggta gtcaagagct 


gacctcagat 


agttatcatc 


catacgaacc 


660 


aagaaatcat 


gaatctgtcc 


tgcagcatac 


catgttggct 


tagataacaa 


atcccctgca 


720 


acagccaatg 


gtgttgcagt 


gaatatcaca 


ttcccaaagt 


aaccaggagg 


cagctgcgga 


780 


cgtagtctag 


accttccatc 


agttgcaatg 


tacagtttcg 


tctcttggtc 


gtttggaagc 


840 


cctcgcgcct 


ttcccactga 


tctccacaca 


tgccctgcca 


acatctcgta 


tgagctgtag 


900 


ctgacagtgt 


tcccatcctc 


ctfcggatttc 


gccttaagag 


caacaagctg 


gtctcgtgtt 


960 


aatttgaata 


tagagacagc 


ggtafctctca 


ggtcctgatt 


tagacggatc 


aagaggtatc 


1020 


ttcatacttg 


gtgcaggctg 


atattcaaca 


tgatgaaaag 


caggctgtgg 


cgggtcccta 


1080 


gctcggagga 


gtgttcgatc 


aatgaaaggt 


ggaatggtta 


ggtcaagacc 


acgagccata 


1140 
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tcagaccatg tgttgataaa atgaagacca gagaaaccat ctgccgcgtg atgttgcatc 1200 

ccaaccccaa gtgaagctcc cccacattta aagaaagtca cctatatcat caaacaatat 1250 

cattacataa ctaaaaacag aacatacatc atcaaagaat atcattacag aactaaaaaa 1320 

cagaacaaca caagcattca ataaattaaa aaagggaaag actttaaatt aagctgacct 1380 

gcaaaacgag aagcgggaaa gagtgaatgc cagcggagtg atcaacttcg ggaataagct 1440 

gacggagatt aagggtagga gcaaaatcac caaaatcatc gataacagaa ggagtatcag 1500 

ccacaacgaa gagaacacca gcaccgttac aatcgatctc aatacgacca tcatcgtctc 15 60 

tcttcaagcg accagccata gggtaaaaag ggacaagggc tttggaaaga gcttccttca 1620 

tgacctgagg gtcaaagaaa ttggaagcgc cggtgggtct gtagaagtag acactagggg 1580 

tatggaatct ggggatgaca aggtcgacgt tggagttcca aagattagtg attggtgtct 1740 

cggtggcagg ccggaccatg gtggaatctc tgatgttaat ttfccatattt ttaagaaaga 1800 

tcaaacgaac aaacaaattt tgttttgagt ttgatctgag aaattgtgtg agaagcgagg 1860 

aggggtagat ataagtgggt gtgaacaaaa aaggaaaggt cgtaatcctt cacctaccgc 1920 

cccacgaccc cgcctactcc atctgtcgac caactctctc agaaggttta tcattggtcc 1980 

atctcatcac tcgg-tcctac ttttctatcc ggggttggta aacccgttcc ctctgtccta 2040 

aggcccaaat aatactaatg ggcttaaata gtataaagtt cgaatgaaga agaaggaaaa 2100 

aaaattgaca agttgattgt tatagctacc gtacaagtta gtcaatcgaa tgcc.aaagtt 2160 

tggcttaact taactcatta ccacgtagca gagcataaca acaaccacca ataacatcta 2220 

acggcgtaaa ctcacgtgtt aagagaatta cgtcagaatc caagaaagtc tttttttcag 2280 

tacaccacaa ataaattatg ttgaatttat atccgaacta tactatatta tatgaaaaca 2340 

tacctcaatt agtcataaat ttcacatttc attttaattt attaactaat ggttgttgtt 2400 

gcggatagct tttctttgct ttctgttcta aaattatttc ttagccatta tttgatgatc 2460 

caacttggtt tgaggatcta ataaaaattc aatttctttg aagcattcat tggcttaatg 2520 

atccaaacct tttgtcctcc cactctctaa gaaaataaag caaaagcaaa actaaaagaa 2580 

aagcaactag taccgtgggt tggtcaatgg taacccatgc taccaaatac caatcaaagc 2640 

ttaaatgaca atttagtacc ttgggttggt catgatttag agcggaacaa atataccata 2700 

catcaaacga ggatatacag agaaaattca tggaagtatg gaatttagag gacaatttct 2760 

cttctgggct acaacggacc ggcccattcg ctcatttacc cagaggtatc gagtttgtgg 2820 

acttttgatg ccgctagaga ctattggcat cggattgaaa aaaatgttta cttcgtfcgtt 2880 

aacaattttc tgaatgcaat attttccttg tcatgaatat ttaaacttgt tattactttc 2940 

ttttagctta ggtgtggaca attatggagt ttacttcaaa cgaggaagaa tcttaaacgc 3000 
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tcggttcagg 




aaaccaactc ■ 


acaatcctga cttaattgag i 


gaaaacaatg 


3060 


caaaaccaca 


tgcatgcttc 


cata.tttcta 


tcataatctt ataagaaaaa 


acactactaa 


3120 


gtgaaatgat 


tctgtatata 


tataaccaat 


gccttttgtt ttgtgatatt 


ttatgtatat 


3180 


ataactattg 


acttttgtca 


tctatggata 


gtgtctcggg ctcttggcaa 


acatatttca 


3240 


aagaaaagtt 


aatgactgta 


attaattaat 


ctgaagctag aaacagaacc 


ccgaggtaaa 


3300 


agaaaaagac 


agagcacatg 


aagtttagta 


cttttatata tttaatatat 


cattctttct 


3360 


tattgcttat 


ctctaaagca 


aaaacttccc 


taaaccctaa gccaaaggac 


tcagatcgat 


3420 


gcagaaccaa 


gaaggcttgt 


tttggatttg 


agagccaaat gcaaagaaaa 


aaactcttat 


3480 


gtctfctgcaa 


atgccgattc 


caagaaaaaa 


agccttaact gtttctcatt 


tttccattac 


3540 


attgtctctg 


tttttagctt 


tcttcatctc 


ctcgacctct gcatcaacca 


atgaagtctc 


3600 


agctttgatt 


tcttggcttc 


acagctcgaa 


ctcgccgcca ccgtcagtct 


tctccggctg 


3660 


gaatccttct 


gattctgacc 


cat.gtcaatg 


gccttacatt acttgtt.cct 


cctcagacaa 


3720 


caaactcgtt 


acagagatca 


atgtcgtctc 


tgttcagtta gctttacctt 


tccctcctaa 


3780 


catttcttcg 


tttacttcac 


ttcagaaact 


cgttatctcc aacactaatc 


tcaccggagc 


3840 


tatatcttct 


gagatcggag 


attgttcgga 


gcttatagtt atcgatttaa 


gctcaaatag 


3900 


tcttgttggt 


gaaatacctt 


cgagtctagg 


gaagctcaag aatcttcaag 


aactttgttt 


3960 


aaactccaat 


ggcctcacag 


gaaagatccc 


accggaactc ggcgactgcg 


ttagcctcaa 


4020 


gaatcttgag 


attttcgata 


actacttatc 


cgagaatctc ccgttggagc 


tcggaaagat 


4080 


ctcgactctt 


gagagcataa 


gagccggagg 


aaactcagag ctttcaggga 


agatcccgga 


4140 


ggagatcgga 


aactgtagga 


atctcaaggt 


cttagggtta gcagcaacga 


aaatatccgg 


4200 


ttctttacct 


gfcttcgttgg 


gtcaactaag 


caagctccag tcti: tatctg 


tctattctac 


4260 


aatgctctca 


ggtgagatcc 


ctaaagagct 


tggaaactgc tctgaactta 


tcaatctgtt 


4320 


tctatacgac 


aatgacttgt 


ccggcacgct 


tccaaaagaa ctaggaaagc 


ttcaaaacct 


4380 


agagaagatg 


cttctatggc 


agaataatct 


ccacggacct atccctgagg 


agattggatt 


4440 


catgaaaagc 


ttaaacgcca 


ttgatctctc 


tatgaactat ttctcaggaa 


ccatccctaa 


4500 


atcgttcggt 


aacttgtcga 


atctccaaga 


gcttatgctt agcagcaaca 


acatcaccgg 


4560 


gtcaatacct 


tcgattctaa 


gtaactgcac 


aaagcttgtt cagttccaga 


ttgacgccaa 


4620 


tcagatttca 


ggtttgattc 


caccagagat 


tggattgctt aaggagctca 


acatcttctt 


4680 


gggatggcag 


aataagctag 


aagggaatat 


tccggacgag ttagctggtt 


gtcagaatct 


4740 


tcaagctctt 


gacttgtcac 


agaattatct 


caccggatca ttacctgcag 


gtttgtttca 


4800 


gcttcgtaat 


ctgactaagc 


tcttgcttat 


atctaatgcc atctccggtg 


ttatcccact 


4860 


ggagatcggg 


aactgcactt 


cgcttgtcag 


attaaggctt gtcaataaca 


gaatcaccgg 


4920 
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agaaatccct 


aaaggtattg 


gatttcttca 


gaaccttagc 


ttcctcgact 


tgtctgagaa 


4980 


caatctctcc 


ggtccagttc 


ctttggagafc 


aagtaattgt 


agacaactcc 


agatgcttaa 


5040 


cttgagcaac 


aacactcttc 


aaggttahct 


tcctctgtct 


ttatcatcgt 


taacaaagct 


5100 


tcaagtactt 


gatgtctctt 


ccaatgactt 


gactggcaag atacctgata 


gtctaggtca 


5160 


tctcatttca 


ctcaaccggc 


tcattctaag 


taagaattct 


ttcaatggag 


aaatcccttc 


5220 


gtctctcggt 


cactgcacga 


atcttcagct 


tcttgatctc 


agcagtaata 


acatctctgg 


5280 


aactataccg 


gaggagcttt 


ttgacatcca 


agatctagac 


attgcgttaa 


acttgagctg 


5340 


gaattcatta 


gatggcttta 


tcccggaaag 


gatttccgcg 


cttaaccgat 


tatccgtgct 


5400 


cgatatttcg 


cataacatgc 


tttcaggcga 


cctttctgcg 


ctgtccggtc 


tagaaaactt 


5460 


ggtttctctg 


aacatctctc 


ataacagatt 


ctcaggttat 


cttccagata 


gtaaagtgtt 


5520 


tagacagctg 


ataggagcag 


agatggaagg 


aaacaatgga 


ctctgttcca 


aaggtttcag 


5580 


gtctfcgcttt 


gtaagtaaca 


gttcacagtt 


aactacacag 


cgtggtgtgc 


actcacatag 


5640 


actcaggaCa 


gccattggat 


tgctaatcag cgtgacagcg 


gttctagcgg 


tattaggcgt 


5700 


gttggcggtt 


atacgagcta 


aacaaatgat 


tcgagatgat 


aacgattcag 


agactggaga 


5760 


aaatctatgg 


acatggcaat 


tcacaccttt 


tcagaaactc 


aacttcacag 


tcgaacatgt 


5820 


actcaagtgt 


ttggtagaag 


gtaatgttat 


agggaaaggt 


tgctccggga 


tagtgtacaa 


5880 


agctgaaatg 


cctaacagag 


aagtcatcgc 


ggtgaaaaag 


ctcfcggccag 


tgacagtacc 


5940 


taatctgaat 


gagaaaacta 


agtcatcagg 


agttcgagac 


tcattctcag 


ctgaagtaaa 


6000 


aacacttgga 


tcgatcagac 


acaagaacat 


tgtacgcttc 


ttgggatgtt 


gttggaacaa 


5060 


gaacactaga 


cttcttatgt 


atgattatat gtcaaatggg agtttgggaa gtttgcttca 


6120 


cgaaaggagc 


ggtgtatgta 


gcttaggatg 


ggaagtgagg 


tacaagatta 


tacttggtgc 


6180 


agctcagggt 


ttggcttact 


tgcaccatga 


ctgtgttcct 


cccattgttc 


atagagacat 


6240 


caaggcaaat 


aatattctga 


ttggccctga 


ttttgaacct 


tacattggag 


atttcggact 


6300 


cgctaagctt 


gttgatgatg 


gcgactttgc 


tcgttcttcc 


aacaccattg 


ctggttccta 


6360 


tggttacata 


gctccaggta 


cttcccaaat 


ttctgttttg 


ctgctaatta 


gttaacttga 


6420 


tacatcataa 


gttactagta 


gtcttagcct 


agtaatggfca 


gaaattcagg 


tttatagttt 


6480 


ggttttggtt 


tttggcattt 


tgaaaagaat 


agtattttgg 


ttcagttgat 


tttcgatttg 


6540 


gtttaaaatt 


ttagaaaaaa 


aatggtagaa 


gaaataattt 


ttgattaata 


ttatatcatc 


6600 


tactgtatac 


gatttaaaaa 


aaaaattgat 


ttggtttggt 


ctcgatattt 


tgcttgtagc 


6660 


ttggttcaat 


atttttacta 


tatatatgat 


tcatgacttt 


catactaatc 


aatatttgtt 


6720 


cggtttcgtt 


ttgattagtc 


tgaaagtgaa 


tcatatagtt 


aaaatattga 


ccaatctatc 


6780 
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atatacttac 


acgctacaag 


cropOllpct . ST2b . txt 
attttcgatg taattttggt gtaagcattt 


tggtttttat 


6840 


atgttcatcc 


ttagtaacta 


acaacaaata 


tcaattggga 


agcataacgt 


tatatattcc 


6900 


cataaggcta 


caacactata 


attatgcaga 


atacggatac 


tcaatgaaga 


taacagagaa 


6960 


aagcgacgtg 


tacagctatg 


gagtcgtagt 


gctcgaggta 


ctaacgggta 


agcaaccaat. 


7020 


cgatccaacg 


ataccagatg 


gactccacat 


agtggactgg 


gtcaagaaaa 


tcagagacat 


7080 


acaagtaatc 


gaccaaggac 


tacaagcaag 


accagagtca 


gaggttgaag agatgatgca 


7140 


aacgctagga 


gtcgcgcttt 


tatgcattaa 


tccaataccc 


gaagacaggc 


ctacaatgaa 


7200 


agatgtggct 


gctatgcfcfca 


gtgagatatg 


tcaagagaga 


gaggaatcga 


tgaaagttga 


7260 


tggttgctcg 


ggaagttgta 


acaatggaag 


agaacgtggc 


aaagatgatt 


cgacttcatc 


7320 


ggttatgcaa 


caaacggcta 


agtatttgag 


aagtagtagc 


acgagtttct 


ctgcgtcttc 


7380 


tttgctttac 


tcttcttctt 


cttctgctac 


ttctaatgtt 


agaccaaatc 


ttaaataa 


7438 



<210> 2 

<211> 3499 

<212> DNA 

<213> Arabidopsis thaliana 

<400> 2 



ctcgagtttc 


agatgtttct 


attaaataag 


taatgaaagc 


agctctaatt 


ctcccaaaat 


60 


tcacaagtat 


agaaaga tag 


aaagatagaa 


acagagttat 


tgcatggaag 


ctgaaaatca 


120 


tacagcatgg 


tggatataac 


caatgagaca 


cciaaacaatc 


acggcatgtg 


ttatagaaca 


180 


acacttatca 


tctgtattaa 


cataaagcaa 


aagttcctat 


acgacatagt 


ccaaagttac 


240 


agaccataag 


cacaccttat 


gatttgttgt 


accctctaaa 


acacaggatg 


atgacaagaa 


300 


gcaaatttgc 


ccaaatttac 


tgagggaaaa acgagggctt 


aacgctctca 


tatctcaaac 


360 


aaaaacttct 


caaacagttt 


catgtgttca gattggaggg caatggcaac ggataagctg 


420 


ccatcattag 


taggacttgg 


tagcacaaaa 


gacaaaccct 


cgtatggaat 


tccaccaggt 


480 


cccataaaga 


taggacgacc 


ccaaccaaag 


tctgcatcat 


aaataggtaa 


tctaacccag 


540 


cttgtgafctc 


ccaaatttgg 


gcacttgtag 


gtatgtgcac 


cgcggacaag 


ggctgacaga 


600 


tcaggctgca 


tctccaggta 


gtcaagagct 


gacctcagat 


agttatcatc 


catacgaacc 


660 


aagaaatcat 


gaatctgtcc 


tgcagcatac 


catgttggct 


tagataacaa 


atcccctgca 


720 


acagccaatg 


gtgttgcagt 


gaatatcaca 


ttcccaaagt 


aaccaggagg 


cagcfcgcgga 


780 


cgtagtctag 


accttccatc 


agttgcaatg 


tacagtttcg 


tctcttggtc 


gtttggaagc 


840 


cctcgcgcct 


ttcccactga 


tctccacaca 


tgccctgcca 


acatctcgta 


tgagctgtag 


900 


ctgacagtgt 


tcccatcctc 


cttggatttc 


gccttaagag 


caacaagctg 


gtctcgtgtt 


960 


aatttgaata 


tagagacagt 


ggtattctca 


ggtcctgatt 


tagacggatc 


aagaggtatc 


1020 
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ttcatacttg 


gtgcaggctg 


atattcaaca 


tgatgaaaag caggctgtgg 


cgggtcccta 


1080 


gctcggagga 


gtgttcgatc 


aatgaaaggt 


ggaatggtta ggtcaagacc 


acgagccata 


1140 


tcagaccatg 


tgttgataaa 


atgaagacca 


gagaaaccat ctgccgcgtg 


atgttgcatc 


1200 


ccaaccccaa 


gtgaagctcc 


cccacattta aagaaagtca cctatatcat 


caaacaatat 


1260 


cattacataa 


ctaaaaacag 


aacatacatc 


atcaaagaat atcattacag 


aactaaaaaa 


1320 


cagaacaaca 


caagcattca 


ataaattaaa 


aaagggaaag actttaaatt 


aagctgacct 


1380 


gcaaaacgag 


aagcgggaaa 


gagtgaatgc 


cagcggagtg atcaacttcg ggaataagct 


1440 


gacggagatt 


aagggtagga 


gcaaaatcac 


caaaatcatc gataacagaa 


ggagtatcag 


1500 


ccacaacgaa 


gagaacacca 


gcaccgttac 


aatcgatctc aatacgacca 


tcatcgtctc 


1560 


tcttcaagcg 


accagccata 


gggtaaaaag 


ggacaagggc tttggaaaga 


gcttccttca 


1620 


tgacctgagg 


gtcaaagaaa 


ttggaagcgc 


cggtgggtct gtagaagtag 


acactagggg 


1680 


tatggaatct 


ggggafcgaca 


aggtcgacgt 


tggagttcca aagattagtg 


attggtgtct 


1740 


cggtggcagg 


ccggaccatg 


gtggaatctc 


tgatgttaat tttcatattt 


ttaagaaaga 


1800 


tcaaacgaac 


aaacaaattt 


tgttttgagt 


ttgatctgag aaattgtgtg agaagcgagg 


1860 


aggggtagat 


ataagtgggt 


gtgaacaaaa 


aaggaaaggt cgtaatcctt 


cacctaccgc 


1920 


cccacgaccc 


cgcctactcc 


atctgtcgac 


caactctctc agaaggttta 


tcattggtcc 


1980 


atctcatcac 


tcggtcctac 


ttttctatcc 


ggggttggta aacccgttcc 


ctctgtccta 


2040 


aggcccaaat 


aatactaatg 


ggcttaaata gtataaagtt cgaatgaaga 


agaaggaaaa 


2100 


aaaattgaca 


agttgattgt 


tatagctacc 


gtacaagtta gtcaatcgaa 


tgccaaagtt 


2160 


tggcttaact 


taactcatta 


Gcacgtagca gagcataaca acaaccacca ataacatcta 


2220 


acggcgtaaa 


ctcacgtgtt 


aagagaatta 


cgtcagaatc caagaaagtc 


tttttttcag 


2280 


tacaccacaa 


ataaattatg 


ttgaatttat 


atccgaacta tactatatta 


tatgaaaaca 


2340 


tacctcaatt 


agtcataaat 


ttcacatttc 


attttaattt attaactaat 


ggttgttgtt 


2400 


gcggatagct 


tttctttgct 


ttctgttcta 


aaattatttc ttagccatta 


tttgatgatc 


2460 


caacttggtt 


tgaggatcta 


ataaaaattc 


aatttctttg aagcattcat 


tggcttaatg 


2520 


atccaaacct 


tttgtcctcc 


cactctctaa gaaaataaag caaaagcaaa 


actaaaagaa 


2580 


aagcaactag 


taccgtgggt 


tggtcaatgg 


taacccatgc taccaaatac 


caatcaaagc 


2640 


ttaaatgaca 


atttagtacc 


ttgggttggt 


catgatttag agcggaacaa 


atataccata 


2700 


catcaaacga 


ggatatacag 


agaaaattca 


tggaagtatg gaafcttagag 


gacaatttct 


2760 


cttctgggct 


acaacggacc 


ggcccattcg 


ctcatttacc cagaggtatc 


gagtttgtgg 


2820 


acttttgatg 


ccgctagaga 


ctattggcat 


cggattgaaa aaaatgttta 


cttcgttgtt 


2 88 0 


aacaattttc 


tgaatgcaat 


attttccttg 


tcatgaatat ttaaacttgt 


tattactttc 


2940 
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ttt-t.i: '• I 'jgtgtggaca attatggagt ttacttcaaa cgaggaagaa tcttaaacgc 3000 

tcgni •. • 1 : ; t ctcgaaaac aaaccaactc acaatcctga cttaattgag gaaaacaatg 3060 

caaaac -.j. a igcatgcttc catatttcta tcataatctt ataagaaaaa acactactaa 3120 

gtq.,.-, >■ ) -.'. ictgtatata tataaccaat gccttttgtt ttgtgatatt ttatgtatat 3180 

atua: ' ,.■ ■ ; (icttttgtca tctatggata gtgtctcggg ctcttggcaa acatatttca 3240 

aaqa.s.1,4 : aaCgactgta attaattaat. ctgaagctag aaacagaacc ccgaggtaaa 3300 

aga.'ia:!., ; . - agagcacatg aagttt.agta cttttatata tttaatatat cattcfcttct 3360 

ta--t ; --.i- jtctaaagca aaaacttccc taaaccctaa gccaaaggac tcagatcgat 3420 

gc.'i 1 • . iia oaaggcttgt tttggatttg agagccaaat gcaaagaaaa aaactcttat 3480 

gtc. '. ■ .} «.i atgccgatt 3499 

<21C> 1 

<:;..>- A:«iidopsis thaliana 

at vj aaatgccgat tccaagaaaa aaagccttaa ctgtttctca tttttccatt 60 

acat t jr ., '.c tgtttttagc tttcttcatc tcctcgacct ctgcatcaac caatgaagtc 120 

tcajrrttga tttcttggct tcacagctcg aactcgccgc caccgtcagfc cttctccggc 180 

tg ; a^i- c f. t. ccgattctga cccatgtcaa tggccttaca ttacttgttc ctcctcagac 240 

aacaa.i - -9 ttacagagat caatgtcgtc tctgttcagt tagctttacc tttccctcct 300 

aAct* : cgtttacttc acttcagaaa ctcgttatct ccaacactaa tctcaccgga 360 

gc a' i^' ■ ctgagatcgg agattgttcg gagcttatag tfcatcgattc aagctcaaat 420 

ag", "a* t ,3 gtgaaatacc ttcgagtcta gggaagctca agaatcttca agaactttgt 480 

tt.aat» ••. c-c« atggcctcac aggaaagatc ccaccggaac tcggcgactg cgttagcctc 540 

aa-jrf, -k: c. t g agatttfccga taactactta tccgagaatc tcccgttgga gctcggaaag 600 

at:t. ttgagagcat aagagccgga ggaaactcag agctttcagg gaagatcccg 660 

giiig-tj*; j jaaactgtag gaatctcaag gtcttagggt tagcagcaac gaaaatatcc 720 

gvj'.' :":a: .rtgcttcgtt gggtcaacta agcaagctcc agtctttatc tgtctattct 780 

acj.i' .jri caggtgagat ccctaaagag cttggaaact gctctgaact tatcaatctg 840 

tttciiii acaatgactt gtccggcacg cttccaaaag aactaggaaa gcttcaaaac 900 

ctc-i 3.i ! j.i Lgcttctatg gcagaataat ctccacggac ctatccctga ggagattgga 960 

ttc.-s' : s^i.i s -Tcttaaacgc cattgatctc tctatgaact atttctcagg aaccatccct. 1020 

aaa' I- 1 .- ? -taacttgtc gaatctccaa gagcttatgc ttagcagcaa caacatcacc 1080 
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gggtcaatac 


cttcgattct 


aagtaactgc 


acaaagcttg 


ttcagttcca 


gattgacgcc 


1140 


aatcagattt 


caggtttgat 


tccaccagag 


attggattgc 


ttaaggagct 


caacatcttc 


1200 


ttgggatggc 


agaataagct 


agaagggaat 


attccggacg 


agttagctgg 


ttgtcagaat 


1260 


cttcaagctc 


ttgacttgtc 


acagaattat 


ctcaccggat 


cattacctgc 


aggtttgttt 


1320 


cagcttcgta 


atctgactaa 


gctcttgctt 


atatctaatg 


ccatctccgg 


tgttatccca 


1380 


ctggagatcg 


ggaactgcac 


ttcgcttgtc 


agattaaggc 


ttgtcaataa 


cagaatcacc 


1440 


ggagaaatcc 


ctaaaggtat 


tggatttctt 


cagaacctta 


gcttcctcga 


cttgtctgag 


1500 


aacaatctct 


ccggtccagfc 


tcctttggag 


ataagtaatt 


gtagacaact 


ccagatgctt 


1560 


aacttgagca 


acaacactct 


tcaaggttat 


cttcctctgt 


ctttatcatc 


gttaacaaag 


1620 


cttcaagtac 


ttgatgtctc 


ttccaatgac 


ttgactggca 


agatacctga 


tagtctaggt 


1680 


catctcattt 


cactcaaccg 


gctcattcta 


agtaagaatt 


ctttcaatgg agaaatccct 


1740 


tcgtctctcg 


gtcactgcac 


gaatcttcag 


cttcttgatc 


tcagcagtaa 


taacatctct 


1800 


ggaactatac 


cggaggagct 


ttttgacatc 


caagatctag 


acattgcgtt 


aaacttgagc 


1860 


tggaattcat 


tagatggctt 


tatcccggaa 


aggatttccg 


cgcttaaccg 


attatccgtg 


1920 


ctcgatattt 


cgcataacat 


gctttcaggc gacctttctg cgctgtccgg 


tctagaaaac 


1980 


ttggtttctc 


tgaacatctc 


tcafcaacaga 


ttctcaggtt 


atcttccaga 


tagtaaagtg 


2040 


t;t:t:agacagc 


tgataggagc 


agagatggaa 


ggaaacaatg 


gactctgttc 


caaaggtttc 


2100 


aggtcttgct 


ttgtaagtaa 


cagttcacag 


ttaactacac 


agcgtggtgt 


gcactcacat 


21S0 


agactcagga 


tagccattgg 


attgctaatc 


agcgtgacag 


cggttctagc 


ggtattaggc 


2220 


gtgttggcgg 


ttatacgagc 


taaacaaatg 


attcgagatg 


ataacgattc 


agagactgga 


2280 


gaaaatctat 


ggacatggca 


attcacacct 


tttcagaaac 


tcaacttcac 


agtcgaacat 


2340 


gtactcaagt 


gtttggtaga 


aggtaatgtt 


atagggaaag 


gttgctccgg 


gatagtgtac 


2400 


aaagctgaaa 


tgcctaacag 


agaagtcatc 


gcggtgaaaa 


agctctggcc 


agtgacagta 


2460 


cctaatctga 


atgagaaaac 


taagtcatca 


ggagttcgag 


actcattctc 


agctgaagta 


2520 


aaaacacttg 


gatcgatcag 


acacaagaac 


attgtacgct 


tcttgggatg 


ttgttggaac 


2580 


aagaacacta 


gacttcttat 


gtatgattat 


atgfccaaatg 


ggagtfctggg 


aagtttgctt 


2640 


cacgaaagga 


gcggtgtatg 


tagcfcagga 


t;,gggaa,gtga 


ggtacaagat 


tatacttggt 


2700 


gcagctcagg 


gtttggctta 


cttgcaccat 


gactgtgttc 


ctcccattgt 


tcatagagac 


2760 


atcaaggcaa 


ataatattct 


gattggccct 


gattttgaac 


cttacattgg 


agatttcgga 


2820 


ctcgctaagc ttgttgatga tggcgacttt gctcgttctt ccaacaccat 


tgctggttcc 


2880 


tatggttaca 


tagctccagg 


tacttcccaa 


atttctgttt 


tgctgctaat 


tagttaactt 


2940 
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gatacatcat 


aagttactag 


tagtcttagc 


ctagtaatgg 


tagaaattca 


ggtttatagt 


3000 


ttggttttgg 


tttttggcat 


tttgaaaaga 


atagtatttt 


ggttcagttg 


attttcgatt 


3060 


tggtttaaaa 


ttttagaaaa 


aaaatggtag 


aagaaataafc 


ttttgattaa 


tattatatca 


312 0 


tctactgtat; 


acgatttaaa 


aaaaaaattg atttggtttg gtctcgatat 


tttgcttgta 


3180 


gcttggttca 


atatttttac 


tatatatatg 


attcatgact 


ttcatactaa 


tcaatatttg 


3240 


ttcggtttcg 


ttttgattag 


fcctgaaagtg 


aatcatatag 


ttaaaatatt 


gaccaatcta 


3300 


tcatatactt 


acacgctaca 


agattttcga 


tgtaattttg 


gtgtaagcat 


tttggttttt 


3360 


atatgttcat 


ccttagtaac 


taacaacaaa 


tatcaattgg 


gaagcataac 


gttatatatt 


3420 


cccataaggc 


tataacacta 


taattatgca 


gaatacggat 


actcaatgaa 


gataacagag 


3480 


aaaagcgacg 


tgtacagcta 


tggagtcgta 


gtgctcgagg 


tactaacggg 


taagcaacca 


3540 


atcgatccaa 


cgataccaga 


tggactccac 


atagtggact 


gggtcaagaa 


aatcagagac 


3600 


atacaagtaa 


tcgaccaagg 


actacaagca 


agaccagagt 


cagaggfctga 


agagatgatg 


3660 


caaacgctag 


gagtcgcgct 


tttatgcatt 


aatccaatac 


ccgaagacag 


gcctacaatg 


3720 


aaagatgtgg ctgctatgct 


tagtgagata 


tgtcaagaga 


gagaggaatc 


gatgaaagtt 


3780 


gatggttgct 


cgggaagttg 


fcaacaatgga 


agagaacgtg gcaaagatga 


ttcgacttca 


3840 


tcggttatgc 


aacaaacggc 


taagtatttg agaagtagta 


gcacgagttt 


ctctgcgtct 


3900 


tctttgcttt 


actcttcttc 


ttcttctgct 


acttctaatg 


ttagaccaaa 


tcttaaataa 


3960 



<210> 4 
<211> 1135 
<212> PRT 

<213> Arabidopsis thaliana 

<400> 4 

Met Ser Leu Gin Met Pro He Pro Arg Lys Lys Ala Leu Thr Val Ser 
15 10 15 

His Phe Ser He Thr Leu Ser Leu Phe Leu Ala Phe Phe He Ser Ser 
20 25 30 

Thr Ser Ala Ser Thr Asn Glu Val Ser Ala Leu He Ser Trp Leu His 
35 40 45 

Ser Ser Asn Ser Pro Pro Pro Ser Val Phe Ser Gly Trp Asn Pro Ser 
50 55 60 

Asp Ser Asp Pro Cys Gin Trp Pro Tyr He Thr Cys Ser Ser Pro Asp 
65 70 75 80 



Asn Lys Leu Val Thr Glu He Asn Val Val Ser Val Gin Leu Ala Leu 
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Pro Phe Pro Pro Asn lie Ser Ser Phe Thr Ser Leu Gin Lys Leu Val 
100 105 110 



lie Ser Asn Thr Asn Leu Thr Gly Ala lie Ser Ser Glu He Gly Asp 
115 120 125 



Cys Ser Glu Leu He Val He Asp Leu Ser Ser Asn Ser Leu Val Gly 
130 135 140 

Glu He Pro Ser Ser Leu Gly Lys Leu Lys Asn Leu Gin Glu Leu Cys 
145 150 155 160 

Leu Asn Ser Asn Gly Leu Thr Gly Lys He Pro Pro Glu Leu Gly Asp 
165 170 175 



Cys Val Ser Leu Lys Asn Leu Glu He Phe Asp Asn Tyr Leu Ser Glu 
180 185 190 

Asn Leu Pro Leu Glu Leu Gly Lys He Ser Thr Leu Glu Ser He Arg 
195 200 205 

Ala Gly Gly Asn Ser Glu Leu Ser Gly Lys He Pro Glu Glu He Gly 

210 215 220 



Asn Cys Arg Asn Leu Lys Val Leu Gly Leu Ala Ala Thr Lys He Ser 
225 230 235 240 

Gly Ser Leu Pro Val Ser Leu Gly Gin Leu Ser Lys Leu Gin Ser Leu 
245 250 255 

Phe Val Tyr Ser Thr Met Leu Ser Gly Glu He Pro Lys Glu Leu Gly 
260 265 270 



Asn Cys Ser Glu Leu He Asn Leu Phe Leu Tyr Asp Asn Asp Leu Ser 
275 280 285 

Gly Thr Leu Pro Lys Glu Leu Gly Lys Leu Gin Asn Leu Glu Lys Met 
290 295 300 



Leu Leu Trp Gin Asn Asn Leu His Gly Pro He Pro Glu Glu He Gly 
305 310 315 320 



Phe Met Lys Ser Leu Asn Ala He Asp Leu Ser Met Asn Tyr Phe Ser 
325 330 335 
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Gly Thr lie Pro Lys Ser Phe Gly Asn Leu Ser Asn Leu Gin Glu Leu 
340 345 350 



Met Leu Ser Ser Asn Asn lie Thr Gly Ser lie Pro Ser lie Leu Ser 
355 360 365 



Asp Cys Thr Lys Leu Val Gin Phe Gin lie Asp Ala Asn Gin lie Ser 
370 375 380 

Gly Leu lie Pro Pro Glu lie Gly Leu Leu Lys Glu Leu Asn lie Phe 
385 390 395 400 

Leu Gly Txp Gin Asn Lys Leu Glu Gly Asn lie Pro Asp Glu Leu Ala 
405 410 415 

Gly Cys Gin Asn Leu Gin Ala Leu Asp Leu Ser Gin Asn Tyr Leu Thr 

420 425 430 



Leu Leu lie Ser Asn Ala lie Ser Gly Val lie Pro Leu Glu Thr Gly 

450 455 460 

Asn Cys Thr Ser Leu Val Arg Leu Arg Leu Val Asn Asn Arg lie Thr 

465 470 475 480 

Gly Glu lie Pro Lys Gly lie Gly Phe Leu Gin Asn Leu Ser Phe Leu 

485 490 495 



Asp Leu Ser Glu Asn Asn Leu Ser Gly Pro Val Pro Leu Glu lie Ser 
500 505 510 

Asn Cys Arg Gin Leu Gin Met Leu Asn Leu Ser Asn Asn Thr Leu Gin 

515 520 525 

Gly Tyr Leu Pro Leu Ser Leu Ser Ser Leu Thr Lys Leu Gin Val Leu 
530 535 540 

Asp Val Ser Ser Asn Asp Leu Thr Gly Lys lie Pro Asp Ser Leu Gly 
545 550 555 560 

His Leu lie Ser Leu Asn Arg Leu lie Leu Ser Lys Asn Ser Phe Asn 
565 570 575 

Gly Glu He Pro Ser Ser Leu Glv His Cvs Thr Asn Leu Gin Leu Leu 
580 585 590 
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Asp Leu Ser Ser Asn Asn lie Ser Gly Thr He Pro Glu Glu Leu Phe 
595 600 605 



Asp He Gin Asp Leu Asp He Ala Leu Asn Leu Ser Trp Asn Ser Leu 
610 615 620 



Asp Gly Phe He Pro Glu Axg He Ser Ala Leu Asn Arg Leu Ser Val 
625 630 635 640 

Leu Asp He Ser His Asn Met Leu Ser Gly Asp Leu Ser Ala Leu Ser 

645 550 655 



Gly Leu Glu Asn uexi Vai Ser Leu Asn He Ser His Asn Arg Phe Ser 
560 665 670 

Gly Tyr Leu Pro Asp Ser Lys Val Phe Arg Gin Leu He Gly Ala Glu 
675 680 685 



Met Glu Gly Asn Asn Gly Leu Cys Ser Lys Gly Phe Arg Ser Cys Phe 
690 695 700 

Val Ser Asn Ser Ser Gin Leu Thr Thr Gin Arg Gly Val His Ser His 
705 710 715 720 

Arg Leu Arg He Ala He Gly Leu Leu He Ser Val Thr Ala Val Leu 
725 730 735 



Ala Val Leu Gly Val Leu Ala Val He Arg Ala Lys Gin Met He Arg 
740 745 750 

Asp Asp Asn Asp Ser Glu Thr Gly Glu Asn Leu Trp Thr Trp Gin Phe 
755 760 765 



Thr Pro Phe Gin Lys Leu Asn Phe Thr Val Glu His Val Leu Lys Cys 
770 775 780 

Leu Val Glu Gly Asn Val He Gly Lys Gly Cys Ser Gly He Val Tyr 

785 790 795 800 

Lys Ala Glu Met Pro Asn Arg Glu Val He Ala Val Lys Lys Leu Trp 
805 810 815 



Pro Val Thr Val Pro Asn Leu Asn Glu Lys Thr Lys Ser Ser Gly Val 
820 825 830 

Arg Asp Ser Phe Ser Ala Glu Val Lys Thr Leu Gly Ser He Arg His 
835 840 845 
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Leu Leu Met Tyr Asp 'Tyr Met Ser Asn Gly Ser Leu Gly Ser Leu Leu 
865 " 870 875 880 

His Glu Arg Ser Gly Val Cys Ser Leu Gly Trp Glu Val Arg Tyr Lys 
885 890 895 

He He Leu Gly Ala Ala Gin Gly Leu Ala Tyr Leu His His Asp Cys 
900 905 910 

Val Pro Pro He Val His Arg Asp He Lys Ala Asn Asn He Leu He 
915 920 925 

Gly Pro Asp Phe Glu Pro Tyr He Gly Asp Phe Gly Leu Ala Lys Leu 
930 935 940 

Val Asp Asp Gly Asp Phe Ala Arg Ser Ser Asn Thr He Ala Gly Ser 
945 950 955 960 

Tyr Gly Tyr He Ala Pro Glu Tyr Gly Tyr Ser Met Lys He Thr Glu 
965 970 975 

Lys Ser Asp Val Tyr Ser Tyr Gly Val Val Val Leu Glu Val Leu Thr 
980 985 990 

Gly Lys Gin Pro He Asp Pro Thr He Pro Asp Gly Leu His He Val 
995 1000 1005 

Asp Trp Val Lys Lys He Arg Asp He Gin Val He Asp Gin Gly 
1010 1015 ■ 1020 

Leu Gin Ala Arg Pro Glu Ser Glu Val Glu Glu Met Met Gin Thr 
1025 1030 1035 

Leu Gly Val Ala Leu Leu Cys He Asn Pro He Pro Glu Asp Arg 
1040 1045 1050 

Pro Thr Met Lys Asp Val Ala Ala Met Leu Ser Glu He Cys Gin 
1055 1060 1065 

Glu Arg Glu Glu Ser Met Lys Val Asp Gly Cys Ser Gly Ser Cys 
1070 1075 1080 



Asn Asn Gly Arg Glu Arg Gly Lys Asp Asp Ser Thr Ser Ser Val 
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Met Gin Gin Thr Ala Lys Tyr Leu Arg Ser Ser Ser Thr Ser Phe 
1100 1105 1110 

Ser Ala Ser Ser Leu Leu Tyr Ser Ser Ser Ser Ser Ala Thr Ser 
1115 1120 1125 



Asn Val Arg Pro Asn Leu Lys 
1130 1135 



<210> 5 
<211> 25 
<:212> PRT 

<213> Arabidopsis thaliana 
<:400> 5 

Arg Tyr Lys He He Leu Gly Ala Ala Gin Gly Leu Ala Tyr Leu His 



His Asp Cys Val Pro Pro He Val His 



<210> 6 
<211> 23 ■ 
<212> DNA 

<213> Arabidopsis thaliana 

<400> 6 

aayaayttma syggtkmdwt hcc 



<210> 7 
<211> 20 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 7 

carythamcg gwgaaathcc 



<210> 8 
<211> 20 
<212> DNA 

<213> Arabidopsis thaliana 

<400> 8 

mayn't savyg gahctathcc 



<210> 9 

<211> 22 

<212> DNA 

<213> Arabidopsis thaliana 



<220> 
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<221> !nisc_feature 
<222> (17).. (17) 

<223> n = a, c, g or t 



<40O> 9 
thccv/ttw 



<210> 10 
<211> 19 
<212> DNA 

<213> Arabidopsis thaliana 
<40O> 10 

aytcdggdgc hahgtadcc 



<210> 11 
<211> 21 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 11 

mayhccraar ctrtavacat c 



<210> 12 
<211> 19 
<212> DNA 

<213> Arabidopsis thaliana 
<40O> 12 

cgatcagaca caagaacat 



<21Q> 13 
<211> 19 
<2I2> DNA 

<213> Arabidopsis thaliana 
<40O> 13 

agcaatggtg ttggaagaa 



<210> 14 
<211> 19 
<212> DNA 

<213> Arabidopsis thi 
<400> 14 

tgcagatctt ygtgaagac 



<210> 15 

<211> 18 

<212> DMA 

<213> Arabidopsis thaliana 



<40 0> 15 

gactccttct ggatgttg 
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<400> 16 

tgtaaaacga cggccagt 18 

<210> 17 
<211> 34 
<212> DMA 

<213> Arabidopsis thaliana 
<400> 17 

cacacaggat ccaatcggca tttgcaaaga cata 34 

<210> 18 
<211> 842 
<212> DMA 

<213> Arabidopsis thaliana 
<400> 18 

aagcttaaat gacaatttag taccttgggt tggtcatgat ttagagcgga acaaatatac 60 

catacatcaa acgaggatat ficagagaaaa ttcatggaag tatggaattt agaggacaat 12 0 

ttctcttctg ggc!:.acaacg gaccggccca ttcgctcatt tacccagagg tatcgagttt 180 

gtggactttt gatgccgcta gagactattg gcatcggatt gaaaaaaatg tttacttcgt 240 

tgttaacaat tttctgaatg caatattttc cttgtcatga atatttaaac ttgttattac 300 

tttcttttag cttaggtgtg gacaattatg gagtttactt caaacgagga agaatcttaa 360 

acgctcggtt caggtctcga aaacaaacca actcacaatc ctgacttaat tgaggaaaac 420 

aatgcaaaac cacatgcatg cttccatatt tctatcataa tcttataaga aaaaacacta 480 

ctaagtgaaa tgattctgta tatatataac caatgccttt tgttttgtga tattttatgt 540 

atatataact attgactttt gtcatctatg gatagtgtct cgggctcttg gcaaacatat 600 

ttcaaagaaa agttaatgac tgtaattaat taatctgaag ctagaaacag aaccccgagg 660 

taaaagaaaa agacagagca catgaagttt agtactttta tatatttaat atatcattct 720 

ttcttattgc ttatctctaa agcaaaaact tccctaaacc ctaagccaaa ggactcagat 780 

cgatgcagaa ccaagaaggc ttgttttgga tttgagagcc aaatgcaaag aaaaaaactc 840 

tt 842 

<210> 19 
<211> 113 0 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 19 

Met Pro lie Pro Arg Lys Lys Ala Leu Thr Val Ser His Phe Ser lie 
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Thr Leu Ser Leu Phe Leu Ala Phe Phe lie Ser Ser Thr Ser Ala Ser 
20 25 30 

Thr Asn Glu Val Ser Ala Leu lie Ser Trp Leu His Ser Ser Asn Ser 



Pro Pro Pro Ser Val Phe Ser Gly Trp Asn Pro Ser Asp Ser Asp Pro 
50 55 60 

Cys Gin Trp Pro Tyr lie Thr Cys Ser Ser Pro Asp Asn Lys Leu Val 



Thr Glu lie Asn Val Val Ser Val Gin Leu Ala Leu Pro Phe Pro Pro 
85 90 95 

Asn lie Ser Ser Phe Thr Ser Leii Gin Lys Leu Val lie Ser Asn Thr 
100 105 110 



Asn Leu Thr Gly Ala lie Ser Ser Glu lie Gly Asp Cys Ser Glu Leu 

115 120 125 

lie Val He Asp Leu Ser Ser Asn Ser Leu Val Gly Glu lie Pro Ser 
130 135 140 

Ser Leu Gly Lys Leu Lys Asn Leu Gin Glu Leu Cys Leu Asn Ser Asn 
145 150 155 160 



Gly Leu Thr Gly Lys lie Pro Pro Glu Leu Gly Asp Cys Val Ser Leu 
165 170 175 

Lys Asn Leu Glu lie Phe Asp Asn Tyr Leu Ser Glu Asn Leu Pro Leu 
180 185 190 

Glu Leu Gly Lys lie Ser Thr Leu Glu Ser He Arg Ala Gly Gly Asn 
195 200 205 



Ser Glu Leu Ser Gly Lys lie Pro Glu Glu He Gly Asn Cys Arg Asn 
210 215 220 



Leu Lys Val Leu Gly Leu Ala Ala Thr Lys He Ser Gly Ser Leu Pro 

225 230 235 240 

Val Ser Leu Gly Gin Leu Ser Lys Leu Gin Ser Leu Phe Val Tyr Ser 
245 250 255 
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Ser Gly Glu lie Pro Lys Glu Leu Gly Asn Cys Ser Glu 
250 265 270 



Leu !'. ' A..:. Leu Phe Leu Tyr Asp Asn Asp Leu Ser Gly Thr Leu Pro 
280 285 . 



Gly Lys Leu Gin Asn Leu Glu Lys Met Leu Leu Trp Gin 
295 300 



•J His Gly Pro lie Pro Glu Glu He Gly Phe Met Lys Ser 
310 315 320 



Lc'u Alii He Asp Leu Ser Met Asn Tyr Phe Ser Gly Thr He Pro 

325 330 335 



Ly:; i -r w-- Gly Asn Leu Ser Asn Leu Gin Glu Leu Met Leu Ser Ser 
340 345 350 

Aar; Ai r, ; 1 f • Thr Gly Ser He Pro Ser He Leu Ser Asp Cys Thr Lys 

I'S 360 365 

Leu ■,.•»;"'■;.:. Phe Gin He Asp Ala Asn Gin He Ser Gly Leu He Pro 
VO 375 380 



Piu :\& Gly Leu TJeu Lys Glu Leu Asn He Phe Gly Trp Gin Asn 

2Bb 390 395 400 

Ly;i . Gly Asn He Pro Asp Glu Leu Ala Gly Cys Gin Asn Leu 

405 410 415 

Gin K'.n l.*u Asp Leu Ser Gin Asn Tyr Leu Thr Gly Ser Leu Pro Ala 
420 425 430 



Giy ".-.-w f^i'j Gin Leu Arg Asn Leu Thr Lys Leu Leu Leu He Ser Asn 
li^. 440 445 

Alt* ::<■ Jcr Gly Val He Pro Leu Glu Thr Gly Asn Cys Thr Ser Leu 
4S- 455 460 



Val \: J L.'-- Arg Leu Val Asn Asn Arg He Thr Gly Glu He Pro Lys 
46b 470 475 480 

Gly :i. Phe Leu Gin Asn Leu Ser Phe Leu Asp Leu Ser Glu Asn 

485 490 495 

Asr. J "- r Gly Pro Val Pro Leu Glu He Ser Asn Cys Arg Gin Leu 
500 505 510 
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Gin Met Leu Asn Leu Ser Asn Asn Thr Leu Gin Gly Tyr Leu Pro Leu 
515 520 525 



Ser Leu Ser Ser Leu Thr Lys Leu Gin Val Leu Asp Val Ser Ser Asn 
530 535 540 



Asp Leu Thr Gly Lys lie Pro Asp Ser Leu Gly His Leu lie Ser Leu 
545 550 555 560 

Asn Arg Leu lie Leu Ser Lys Asn Ser Phe Asn Gly Glu lie Pro Ser 

565 570 575 



Ser Leu Gly His Cys Thr Asn Leu Gin Leu Leu Asp Leu Ser Ser Asn 
580 585 590 



Asn lie Ser Gly Thr lie Pro Glu Glu Leu Phe Asp lie Gin Asp Leu 
595 600 605 



Glu Arg lie Ser Ala Leu Asn Arg Leu Ser Val Leu Asp lie Ser His 
625 630 635 640 

Asn Met Leu Ser Gly Asp Leu Ser Ala Leu Ser Gly Leu Glu Asn Leu 
645 650 655 

Val Ser Leu Asn He Ser His Asn Arg Phe Ser Gly Tyr Leu Pro Asp 
660 665 670 

Ser Lys Val Phe Arg Gin Leu lie Gly Ala Glu Met Glu Gly Asn Asn 
675 680 685 

Gly Leu Cys Ser Lys Gly Phe Arg Ser Cys Phe Val Ser Asn Ser Ser 
690 695 700 



Gin Leu Thr Thr Gin Arg Gly Val His Ser His Arg Leu Arg lie Ala 
705 710 715 720 

lie Gly Leu Leu He Ser Val Thr Ala Val Leu Ala Val Leu Gly Val 
725 730 735 

Leu Ala Val He Arg Ala Lys Gin Met He Arg Asp Asp Asn Aso Ser 
740 745 750 

Glu Thr Gly Glu Asn Leu Trp Thr Trp Gin Phe Thr Pro Phe Gin Lys 
755 760 755 
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Leu Asn Phe Thr Val Glu His Val Leu Lys Cys Leu Val Glu Gly Asn 
770 775 780 

Val lie Gly Lys Gly Cys Ser Gly lie Val Tyr Lys Ala Glu Met Pro 
785 790 795 800 

Asn Arg Glu Val lie Ala Val Lys Lys Leu Trp Pro Val Thr Val Pro 
805 810 815 

Asn Leu Asn Glu Lys Tlrr Lys Ser Ser Gly Val Arg Asp Ser Phe Ser 
820 825 830 

Ala Glu Val Lys Thr Leu Gly Ser lie Arg His Lys Asn lie Val Arg 
835 840 845 

Phe Leu Gly Cys Cys Trp Asn Lys Asn Thr Arg Leu Leu Met Tyr Asp 
850 855 860 

Tyr Met Ser Asn Gly Ser Leu Gly Ser Leu Leu His Glu Arg Ser Gly 
865 870 875 880 

Val Cys Ser Leu Gly Trp Glu Val Arg Tyr Lys lie He Leu Gly Ala 
885 890 895 

Ala Gin Gly Leu Ala Tyr Leu His His Asp Cys Val Pro Pro He Val 
900 905 910 

His Arg Asp He Lys Ala Asn Asn He Leu He Gly Pro Aso Phe Glu 
915 920 925 

Pro Tyr He Gly Asp Phe Gly Leu Ala Lys Leu Val Asp Asp Gly Asp 
930 935 940 

Phe Ala Arg Ser Ser Asn Thr He A.la Gly Ser Tyr Gly Tyx He Ala 
945 950 955 960 

Pro Glu Tyr Gly Tyr Ser Met Lys He Thr Glu Lys Ser Asp Val Tyr 
965 970 975 

Ser Tyr Gly Val Val Val Leu Glu Val Leu Thr Gly Lys Gin Pro He 
980 985 990 

Asp Pro Thr He Pro Asp Gly Leu His He Val Asp Trp Val Lys Lys 
995 1000 1005 



He Arg Asp He Gin Val He Asp Gin Gly Leu Gin Ala Arg Pro 
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1015 1020 



Cixi Val Glu Glu Met Met Gin Thr Leu Gly Val Ala Leu 
1030 1035 



Lej lo Asn Pro lie Pro Glu Asp Arg Pro Thr Met Lys Asp 

. : ,■■ 1045 1050 



Val A Ala Met Leu Ser Glu He Cys Gin Glu Arg Glu Glu Ser 
1' 1060 1065 



MC. ; V.il Asp Gly Cys Ser Gly Ser Cys Asn Asn Gly Arg Glu 

:^3C) 1075 1080 

Ar cj y l./ti Asp Asp Ser Thr Ser Ser Val Met Gin Gin Thr Ala 

J.090 1095 

Lys T-, r L«u Arg Ser Ser Ser Thr Ser Phe Ser Ala Ser Ser Leu 

r.oo 1105 1110 

Le-, -r-.-r ;:.*r Ser Ser Ser Ser Ala Thr Ser Asn Val Arg Pro Asn 

i::'. 1120 1125 
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